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INTRODUCTION 


Much of Clifford algebra is quite simple minded. If this fact were generally 
recognized, Clifford algebra would be more widely used as a computational 
tool. 

Entire books have been written on the calculus of manifolds. However, 
one does not need to master the contents of such a book before one is able 
to use different coordinate systems to solve problems. Similarly much of the 
material usually covered in books on Clifford algebra is unnecessary for a 
very broad spectrum of applications. 

The applications discussed in this book range from special relativity and 
the rotating top at one end of this spectrum to general relativity and Dirac's 
equation for the electron at the other end. In Chapter 9, we present 
an elementary derivation of the Kerr metric which is the basis for the 
mathematics of black holes. In Chapter 10, we present a second derivation 
of the Kerr metric. This second derivation is more sophisticated but it is 
also more straightforward than the derivation presented in Chapter 9. 

The math prerequisites for this book are the usual undergraduate 
sequence of courses in calculus plus one course in linear algebra. The physics 
prerequisites for most of the book correspond to that covered by an 
undergraduate physics major. However, a few applications discussed in the 
book may require some physics usually covered in the first year of graduate 
school. 

Clifford algebra has become a virtual necessity for the study of some 
areas of physics and its use is expanding in other areas. Some physicists have 
been using Clifford algebra without realizing it. In quantum electrodynamics, 
it has been discovered that many algebraic manipulations involving Dirac 
matrices can be carried out most efficiently without reference to any 
particular matrix representation. This is Clifford algebra. 

In Klein- Kaluza theories and dimensional renormalization theories, one 
must deal with spaces of dimensions other than the four used in Dirac's 
equation. In such situations one can no longer use the usual 4 x 4 matrix 
representations of Dirac matrices and if one wishes to consider spaces of 
arbitrary dimension, one is forced to avoid reference to any particular 
representation. This requires Clifford algebra. Clifford algebra is now being 
applied in a very conscious and explicit manner in the formulation of 
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superstring theories. Thus Clifford algebra has become an indispensable tool 
for those at the cutting edge of theoretical investigations. 

Vector calculus has long been regarded as a universal language of 
physicists. In recent years there have been those who have strongly recom- 
mended that all serious physicists become conversant with differential forms. 
However Clifford algebra encompasses both of these areas of mathematics 
along with tensor calculus. 

Another formalism that has been effectively used and promoted by Roger 
Penrose and others is that of spinors. It is interesting to note that in the 
appendix of their book, Spinors and Space-Time, Vol. 2 (1986, pp. 440—464), 
Penrose and Rindler use the structure of Clifford algebra to generalize the 
notion of spinors that has been used for 4-dimensional spaces with signature 
(+, —, —, —) to spaces of arbitrary dimension. 

The structure of differential forms and tangent vectors is embedded in 
the structure of Clifford aigebra with only slight modifications. Thus 
advanced readers who already have some mastery of differential forms should 
find the content of much of this book to be familiar territory. 

Actually, in the context of Clifford algebra, the formalism of differential 
forms and tangent vectors can be substantially simplified. In the usual 
formulation, dx’ and 0/0x/ are presented as coordinate bases of dual spaces. 
The distinction between these spaces is necessary when no metric is given. 
However, for most physical applications, one needs to introduce a non- 
singular metric which generates an isomorphism between these spaces. 
Spaces that are isomorphic are essentially identical. In the formalism of 
differential forms as usually presented, one does not take advantage of this 
fact. Even in the presence of a metric, unnecessary distinctions are main- 
tained. This creates a multitude of products, mappings, and spaces which 
require a considerable amount of bookkeeping in the notation. 

In the usual formalism, the tangent vector 0/0x! is treated as the image 
of the differential form g,, dx' under an isomorphic mapping. In the structure 
of Clifford algebra, the analogous relation becomes a simple equality: 
7, = g Y’. This simple difference in the two formalisms enables one to carry 
out computational manipulations in Clifford algebra which would either be 
awkward or illegal in the usual formalism of differential forms. All products 
that appear in Clifford algebra can be readily grasped by anyone familiar 
with matrix multiplication. 

One important feature of this book is a substantial discussion of 
Fock-Ivanenko 2-vectors (Fock and Ivanenko 1929; Fock 1929). These 
2-vectors were introduced by Vladimir Fock and Dimitri Ivanenko to 
make Dirac's equation for the electron compatible with the dictates of 
general relativity. Using these 2-vectors, denoted by L,, Dirac's equation 


becomes 
Ó ie imc 
y* -T,+— A, = —— Ų. 
i (4 he )« nY 


In this context, the T,’s may be regarded as components of a gauge field. 
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It has been said that gravitational fields are not Yang- Mills fields but 
it is interesting to note that 


0 Ó 
cae cg lc LEURS 398, 





where #,, is the curvature 2-form. Furthermore, using the formalism of this 
text, a slightly generalized version of Einstein's field equations can be cast 
in the form of a Yang-Mills equation. Namely 


1 


War Vly — $0") = F". 


This is equivalent to 
R^ y" ^ y = 8n(T*, ve pHi ee A y" 


where the R*,s are components of the Ricci tensor and the 7*,'s are 
components of the energy-momentum tensor. This form of Einstein's field 
equations admits the possibility of a nonzero cosmological constant. 

I have found that the Fock-Ivanenko 2-vectors can be used to expedite 
the computation of both the Schwarzschild and the Kerr metric. These 
computations are carried out in this text. 

Different readers of this book will have different interests. Some will be 
interested in some applications but not in others. Some readers may wish 
to read the mathematical portions and omit some or even most of the 
applications. This is generally possible. Several sections of the book are 
unnecessary for the understanding of the succeeding contents of the book. 
These sections have been designated with asterisks in the table of contents. 
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A TASTE OF CLIFFORD ALGEBRA IN 
EUCLIDEAN 3-SPACE 


1.1 Reflections, Rotations, and Quaternions in E? via Clifford Algebra 


One usually represents a vector x in a 3-dimensional Euclidean space E? 
by x =x! + x?j + x3k or (x!, x?, x3) = x1(1, 0, 0) + x (0, 1, 0) -- x*(0, 0, 1). 
However there are many alternative representations that could be used. For 
example, we could write 


x= xP x xt (1.1) 
where 
0001 i. 01 0 
= [oot 0 . [o 6; -=i 
AT sae esl Suas dpud SUE 
1000 0—10 0 
and 
y 0) 0 
p De EC 0o x8 
Hlc d Tn 
00 0 -i 


At first sight this may seem to be a pointless variation. However, 
representing a vector in terms of these square matrices enables us to multiply 
vectors in a way which would not otherwise be possible. We should first note 
that the matrices ?,, ?,, and $4 have some special algebraic properties. In 
particular 


(71)? = CV =Q = I, (1.3) 
3 
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where I is the identity matrix. Furthermore 
3293 + 3332 = 7391 + 9193 = 39132 + 2231 = 9. (1.4) 


A set of matrices that satisfy Eqs. (1.3) and (1.4) is said to form the 
basis for the Clifford algebra associated with Euclidean 3-space. There are 
matrices other than those presented in Eq. (1.2) which satisfy Eqs. (1.3) and 
(1.4). (See Problem 1.2.) In the formalism of Clifford algebra, one never deals 
with the components of any specific matrix representation. We have intro- 
duced the matrices of Eq. (1.2) only to demonstrate that there exist entities 
which satisfy Eqs. (1.3) and (1.4). 

Now let us consider the product of two vectors. Suppose y = y!?, + 
y^); + y?),. Then 
fafa 
x43 + xy? Pips + xt ype + x^ y Poh. (1.5) 


xy = (xty! + x?y? + xy + xy? + xy 


Using the relations of Eq. (1.4), we have: 


xy = (xly! + x?y? + xy) + (xy? — x? yu, 
+ (x3y? — xty?)p39, + (y? — x? yy 4. (1.6) 


(Note: xy # yx.) 

We notice that the coefficient of I in Eq. (1.6) is the usual scalar product 
Cx, yy. Furthermore the coefficients of },)3, 7,93, and 7,2, are the three 
components of the cross product x x y. 

By considering all possible products, one obtains an 8-dimensional space 


spanned by {, Îi» Pas Pas 2233. Pat Filo 419233); where 


0 0 —1 0 0 0 0 1 
0 0 0 1 - 0 0 1 0 
7273 = 1 ol 7371 = ü -100 
0 —1 0 0 —1 0.0 0 
0 —1 0 0 0 =i 0 0 
"m 1 0 0 0 — 1 0 0 0 
A2 0 o0 exl and V1%273 = 0 0 1 
0 0 1 0 0 0 —1 0 


One might think that one could obtain higher order products. However, any 
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Fig. 1.1. Vector x’ 1s the result of reflecting vector x with respect to the plane perpendic- 
ular to a. 


such higher order product will collapse to a multiple of one of the eight 
matrices already listed. For example: 


In this fashion, we have obtained an 8-dimensional vector space that is 
closed under multiplication. A vector space that is closed under multiplica- 
tion is called an algebra. An algebra which arises from a vector space with 
a scalar product in the same manner as this example does from E? is called 
a Clifford algebra. (We will give a more formal definition of a Clifford algebra 
in Chapter 3.) 

The matrices },, f2, and ?4 are known as Dirac matrices. Any linear 
combination of Dirac matrices is a /-vector. A linear combination of 7,73, 
fafi, and $,?, is a 2-vector. In the same vein, a multiple of I is a 0-vector 
and any multiple of 7,2594 is a 3-vector. 

A general linear combination of vectors of possibly differing type is a 
Clifford number. 

It will be helpful to use an abbreviated notation for products of Dirac 
matrices. In particular let 


723 R Î23 9351 i 931 PiP2 = 9i» and 312293 = fiz- 


The algebraic properties of Clifford numbers provide us with a con- 
venient way of representing reflections and rotations. Suppose a is a vector 
of unit length perpendicular to a plane passing through the origin and x is 
an arbitrary vector in E? (see Fig. 1.1). In addition, suppose x' is the vector 
obtained from x by the reflection of x with respect to the plane corresponding 
to a. Then 


x'— x — 2a, x»a. (1.7) 
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Fig. 1.2. When x 1s subjected to two successive reflections first with respect to a plane 
perpendicular to a and then with respect to a plane perpendicular to 5, the result 1s a rotation 
of x about an axis ın the direction a x b. The angle of rotation ıs twice the angle between a and b 


From Eq. (1.6), it is clear that 

ax + xa = 2a, x»l. (1.8) 
Using this relation, Eq. (1.7) becomes 

x’ = x — axa — x(a)’. (1.9) 


However, using Eq. (1.6) again and the fact that a is a vector of unit length, 
we have 


(a = (a, ol — I. 
Thus Eq. (1.9) becomes 
x’ = —axa. (1.10) 

A rotation is the result of two successive reflections. (See Fig. 1.2.) 

From Fig. 1.2, it is clear that if x” = —bx'b = baxab, then x" is the 
vector that results from rotating vector x through an angle 24 about an axis 
with the direction of the axial vector a x b. We can rewrite this relation in 
the form: 


x" = RIR where | 2? -— ba. (1.11) 


It is useful to explicitly compute the product ba and interpret the separate 
components. If 


a =a f, + a?) + a°; 
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and 
b= bj, T b^j, + b?j5, 
then from Eq. (1.6) 
R = ba = (a, byI — (a x b) j,, — (a x b?3, — (a x 5) 49. 
Since both a and b are vectors of unit length (a, b» = cos y. Furthermore 
the magnitude of a x b is sin y. For this reason, in the usual row matrix 


representation: 


a x b= ((a x b), (a x b», (a x b) = sin (n, n?, n?) 


where b!, n?, and n? are the direction cosines of the axial vector a x b. With 
this thought in mind, we have 


R = I cos V — sin y(n!j,4 + n?j4, + 1?$,,). 


We should note that y represents 2 the angle of rotation. If 0 is the 
actual angle of rotation, we then have 


0 0 
R = I cos 2 sin 5 (n°923 + n7$3, +n°9,9). (1.12) 


To obtain R~! from &, one can replace 0 by —0 or reverse the order 
of the Dirac matrices. In either case 


0 . 0 
RT! =I cos zt sin 5 ("23 +793, +n°9,2). (1.13) 


Readers who are familiar with quaternions should note the striking 
resemblance between the right-hand side of Eq. (1.12) and the rotation 
operator written in terms of quaternions. In the theory of quaternions 
introduced by William Hamilton in 1843, the same rotation operator written 
as a quaternion appears in the form: 


0 cei s 
R = 1 cos + sin - (n'i + on^j om). (1.14) 
Indeed, we can identify i, j, and k respectively with —y;,4, —y3,, and 
— 712. The algebraic relations which define i, j, and k are 
Gy = Gy - (Ky = —1, (1.152) 
jk= —kj =i, ki = —ik — j, (1.15b) 


j= —ji=k. (1.15c) 
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The same equations written in terms of the 2-vectors associated with E? 
become 


(72337223 = C^ $30€— 230 = C 29123€- 312 = —1, — (1.162) 








(733067512 = (F12)¢ Jai) ay ( 223). (1.16b) 
(—7413Y(7223) = — C7 923—912) = C23). (1.16c) 
(~ P23 723) = —(—53)(€7 223) xs (—512) (1.16d) 


In Hamilton's formulation, a vector x is represented as xti + x?j + x^k 
and the rotated vector x" is computed by the quaternion version of Eq. (1.11). 
However the Hamilton approach fails to make the distinction between an 
ordinary vector and an axial or pseudo-vector. 

An example of a pseudo-vector in E? is the cross product a x b formed 
from the composition of two ordinary vectors a and b. In standard 
elementary treatments of vectors in E?, no distinction is made between an 
ordinary vector and a pseudo-vector. This is largely due to the fact that a 
pseudo-vector in E? has the same number of components as an ordinary 
vector. Furthermore both behave in the same fashion under a rotation. 

However, under a reflection, the two entities behave quite differently. 
For example suppose we consider a reflection with respect to the yz-plane. 
Let 


a-(ala?,a), | b (bl b?, b^), 
and 
a x b = (a?b? — ab’, ab! — atb’, a! b? — a?b!). 
After reflection with respect to the yz-plane, the reflected vectors a’ and b' 
may be obtained respectively from a and 5 by changing the sign of the first 
component in each vector. That is 


a —(—al,a^,aà) and P =(—bt,b?, b’). 


On the other hand 





a’ x b = ((a?b? — a?b?), — (ab! — a'b?), — (atb? — a?b)). 





Thus in contrast to the ordinary vectors a’ and b’, we see that it is the 
sign of the second and third components which have reversed sign for the 
pseudo-vector a’ x b'. 

In the context of Clifford algebra, this distinction between ordinary 
vectors and axial or pseudo-vectors is automatic. In the formalism of Clifford 
algebra, ordinary vectors appear as 1-vectors and cross products appear as 
2-vectors. A similar distinction is made between scalars such as (a, b» which 
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do not change sign under a reflection and pseudo-scalars such as (a, b x c» 
which do change sign under reflection. In the formalism of the Clifford 
algebra associated with E?, scalars appear as 0-vectors and pseudo-scalars 
appear as 3-vectors. 

Even in the context of Clifford algebra, some care must be taken in the 
computation of reflections. For example, suppose a and b are 1-vectors and 
a' and b' represent the reflections of a and b with respect to the yz-plane. Then 


a’ = —), a). (1.17) 


The cross product of a and b can be represented by the antisymmetric 
product 


(ab — ba) = (a?b? — a?b?)$,, + (aà)b! — a!b?),, + (alb? — a?b3)5,,. 
We observe that 
a'b —(—)149)(— 9,591) = fab? . 
Since a similar relation holds for the product b’a’, we have 
a'b — ba^) = 9, X(ab — ba)f,. (1.18) 


We note that Eq. (1.8) has the same form as Eq. (1.7) except for a 
difference in sign. 

The problem of the choice of sign that appears with reflections does not 
occur with rotations. This is true even in higher dimensional spaces where 
vectors of higher order can occur. Suppose a, 4&2, . .., a, are all 1-vectors. If 


a, = Ra, Rt fork =1,2,...,p 
then 
4,058, ...d, = (Ra, R ARAR *)... (Ra, R *) 
= R(a,a za; ... a )R t. 


In closing this section, we should note that in the formalism of Clifford 
algebra, the rotation operators are double valued. That is if x' = 22x29! 
then 4? can be replaced by — 22. This point will be pursued a little further 
in the next section. 

It should also be noted that the idea of decomposing a rotation into a 
product of reflections can be extended to n-dimensional Euclidean or 
pseudo-Euclidean spaces. For such spaces it can be shown that any rotation 
is the product of an even number <n of reflections. This theorem was first 
proved by Élie Cartan for both the real and the complex domains (1938, pp. 
13-17; 1966, pp. 10—12). This result was later extended to vector spaces over 
scalar fields with characteristic #2 by Jean Dieudonné (1948, pp. 20-22). 


10 CLIFFORD ALGEBRA 

Problem 1.1. From the form of Eq. (1.11), it is clear that if the rotation 
operators 4? and & represent two successive rotations, then the combined 
rotation is presented by the product 22'22. Use this fact and Eq. (1.12) to 
show that a 90? rotation about the x-axis followed by a 90? rotation about 
the y-axis is equivalent to a 120° rotation about the axis which has the 
direction of the vector (1, 1, —1). 

Problem 1.2. There are many representations that can be used for 5,, fa, and 


$4. One convenient representation is that using Pauli matrices o4, o;, and 
64. That is, we can let 


and 


N |. d 
^ = 0 = 3 
73 3 0 i 


Show that in this representation Eq. (1.3) and Eq. (1.4) are satisfied. 


Problem 1.3. In this representation introduced in Problem 1.2, the quater- 
nions i, j, and k are represented by complex 2 x 2 matrices. In particular 


) , ! 0 -i 
p -m= in| | 
: . i 0 —1 
j= -1 = -in =| 3 


z . ! a x 
= — f}, = —104— : 
712 3 0 i 


In this representation the rotation operator 


and 


0 0 
R=! cos > + sin 5 (in! + jn? + kn?) 





ME: VUES S 3 
COS in” sin sin Tan 
2 2 2” ) 
E 0 0 
sin 5 (n? — in!) cos > + in? sins 


Show that, in this representation, the matrix representing 2 is unitary and 
has determinant equal to 1. (From this result, it is clear that the algebraic 
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properties of the double-valued rotation operators can be ascertained by 
studying the algebraic properties of 2 x 2 unitary matrices whose deter- 
minant is 1. For this reason the group of double-valued rotation operators 
is labeled SU(2). The letter U indicates “unitary”. The letter S indicates 
“special” which in the context of group representation theory means the 
determinant equals 1.) 


Problem 1.4. Suppose 


clos x uA. 
2 2 


where 





" ; ; " 2,5 5 
û — nli + n’ j++ ik = —n'$,4 — nfs n^9;5. 


Show that exp[A(6/2)] = 2. Hint: represent exp(A[(6/2)] by a Taylor's series 
and then separate the odd and even powers of fi. 


1.2 The 4r Periodicity of the Rotation Operator 


From the consequences of the last section, we see that if the vector x(0) 
represents the result of rotating vector x(0) through an angle 0, then we can 
represent the rotation in the form: 


x(0) = 20(0)x(0) R- (0) (1.19) 


where 
0 0 
&R(0) = I cos - — Â sin -, 
2" 2 
â = n'ag + n^), + 07912, 


and nt, n’, along with n?, are the direction cosines of the axis of rotation. 

Although x(0) has a period of 27, (0) has a period of 4x! With the 
development of quantum mechanics in the 1920s, it became recognized that 
4r periodicities do occur in nature. To explain the observed structure of the 
hydrogen energy spectrum, it was necessary to attribute to the electron a 
spin of $ and a periodicity of 4r. In recent years, it has become more widely 
recognized that objects larger than electrons also have 4m periodicities 
(Bolker 1973). A demonstration of this fact has been put forward by Edgar 
Riefin (1979). 

For an object to display a 4x periodicity it is necessary that it be in 
some sense loosely attached to its surroundings. 

To illustrate this, you may wish to carry out a demonstration. First hold 
a glass of water in the palm of your hand. The hand holding the glass may 
be left or right but it is important that your hand be under the glass with 
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(a) 0 (b) 100° 





(c) 180° (d) 250° 


Fig. 13. (a)-(d). 


palm up. Then maintain a firm grip on the glass and rotate it 360° without 
moving your feet or spilling any water. You may do this in either the 
clockwise or counterclockwise direction. When you have completed this 
maneuver, you will find yourself in an awkward position with the glass 
slightly above your head and your elbow pointed upward. Clearly the 
relationship of the glass to you is quite different from what it was in its initial 
position. However if you continue the rotation, you may be surprised to find 
that your arm will unwind itself and the glass will return to its initial position 
with its initial relationship to you. 

Thus the glass attached to your arm does not have a 2r periodicity but 
it does have a 4r periodicity. 

This demonstration is shown in Fig. 1.3 where a book is used in place 
of a glass of water. 


13 The Spinning Top (One Point Fixed)—Without Euler Angles 


(The reader may omit this section without sacrificing his or her ability to 
comprehend the rest of this book.) 
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(e) 430 (f) 570 





(g) 650 (h) 720° 


Fig. 1.3. (e)-(h) A book which does not have a 2n periodicity but does have a 4n periodicity. 


One useful application of Clifford algebra is the spinning top problem 
with one point fixed. One of the most significant difficulties with this 
problem is that of parameterizing the rotational motion. It is somewhat 
awkward to write down the matrix representing a given rotation unless the 
rotation happens to be around the x, y, or z axis. On the other hand, any 
rotation in Euclidean 3-space can be decomposed into a succession of 
three rotations about coordinate axes. It is for this reason that the required 
rotational transformations for the spinning top are usually expressed in terms 
of Euler angles. 

A drawback to the use of Euler angles is the fact that not all the Euler 
angles are intrinsic to the motion of the top. Despite claims to the contrary, 
none of the Euler angles corresponds to the spin of the top about its own 
axis. Using Clifford algebra, it is easy to represent a rotation about an 
arbitrary axis. This enables one to introduce a more intrinsic set of 
coordinates. 

To discuss the problem of the motion of a rigid body about a fixed 
point, one must not only be able to express the coordinates of a point of the 
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top in body 
coordinate 
system 





Xp YB 


Fig. 1.4. The top in its own coordinate system. 


body in the spatial coordinates of the observer, but also in the body 
coordinate system which is attached to the rigid body. This can be done 
conveniently using the formalism of Clifford algebra. 

If the bottom tip of the top's axis of symmetry is stationary and 
coincident with the origin, it is possible to describe the position and 
orientation of the top at any instant in the spatial coordinate system of the 
observer in terms of three angles. I will refer to the magnitude of the spin 
of the top about its own axis as the spin angle and designate that angle by 
w(t). (See Figs. 1.4 and 1.5.) I will refer to the angle between the vertical 
z-axis and the top's axis of symmetry as the tilt angle and designate that 
angle by 0(t). (See Fig. 1.6.) Finally, I designate the angle of precession by 
$(t). The angle of precession is understood to be the angle between the 
xz-plane and the plane spanned by the top's axis and the z-axis. (See Fig. 1.6.) 

In the body coordinate system used here, the top is stationary and the 
axis of symmetry is coincident with the z-axis and with the bottom tip of 
the top coincident with the origin. 

To transform from the body coordinate system to the spatial coordinate 
system, we need the operator which will physically rotate the top from this 
stationary vertical position to the position and orientation of the top at some 
instant of time as seen by an observer. This rotation is somewhat complex 
but it may be considered to be the composition of two fairly simple rotations. 
The first is the rotation of the top by the spin angle y(t) about the z-axis. 
The second is the rotation by the tilt angle 6(t) in the plane spanned by the 
z-axis and the axis of the top in the position as seen by the observer. The 
axis of rotation for this second rotation lies in the xy-plane with direction 
cosines ( — sin @, cos $, 0). (See Fig. 1.6.) 
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X 


Fig. 1.5. The top rotated by angle y about its own axis. 





top in spatial 
coordinate 


Fig. 1.6. The top tilted by angle 0 from the z-axis. The plane containing the z-axis and the axis 
of the top makes an angle with the xz-plane That angle is designated by $ and denotes the 
angle of precession. 
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The first rotation operator is simply 
Rspin = I cos y — $15 sin A (1.20) 
2 2 
The second rotation operator is 
Oi kos ot, ADE 2 ER 
Rir = I cos 7 +3 sin 2 sin $ — 73, sin 5 cos $. (1.21) 


If we designate an arbitrary point in the top as seen in the spatial 
coordinates of the observer by x and the same point in the body coordinate 
system by xg, we have 


x= Rx,R™ (1.22) 

where 
R= Ry rA spy. (1.23) 
To obtain the equation of motion for a rotating top, we need to obtain 


expressions for both the kinetic energy T and the potential energy V. The 
kinetic energy of a rigid body is represented by the integral 


T- ; | p(xY&y? dV, (1.24) 


where p(x) is the mass density at the point x and x is the time derivative of x. 

To compute 7, it is a virtual necessity to convert the variables of 
integration to the body coordinate system. With this thought in mind, let 
us now set out to compute (x)? in the body coordinate system. From Eq. 
(1.22), we have 


X= xy + RXR! 
or 
i= RIR Rx, + XRR'R)R™. (1.25) 


(Note: 22^! is the time derivative of 22^! and not the inverse of 22 which 


may not exist.) : i 
Since A'R = 1, it follows that RIR + RIR — 0. Using this 
relation to eliminate 22^ 122 in Eq. (1.25), we have 


t= RIR x,— xy P) 
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and 
(4)? = RR'Rxy, — pR RPR! 
or 
gp (xy =(RNRx, — pR RYP. 
Since (X)? is a scalar, it commutes with 22, and this last equation becomes 
(3)! = (R' Rx, — xy RY. (1.26) 


To continue our computation of (x)? in terms of the body coordinates, 
we are now faced with the task of computing 22^ +R. From Eq. (1.23), 


7 = D MEC ze Rout Bsr 
and 
R! = Rp TET 
Therefore 


2 !g = Rei Rit Brr) Bsr * Ren Rep: 


Completing this grubby calculation, one gets 


RR = —3(P23@1 + $3105 + $1503). (1.27) 
where 
0, = —dsin 0 cos(j — $) + Ê sin(y — $), (1.28) 
05 = $ sin 0 sin( — $) + Ê cos(j — $), (1.29) 
(4 = V — d(1 — cos 0). (1.30) 


From Eqs. (1.26) and (1.27), we get 


E) = o) + (03)* + (03)? (xp)? — Y, 050, xx. (1.31) 
dk 


Substituting this result into Eq. (1.24) gives us 


1 7 
T 2-3. I*o,o, 
2 Tk 
where 


pue [oeste — (x4) dV 


18 CLIFFORD ALGEBRA 
and 


I*- - [ssi dV for j Æ k. 


The I/*s are constants known as moment of inertial coefficients. If the 
top is cylindrically symmetric, we can identify the axis of symmetry with the 
z or x? axis of the body coordinate system. Because of the cylindrical 
symmetry it is not difficult to show that I/* = 0 if j # k. For example 


Pts y's = SEV + y?, z)xz dx dy dz 


= -ff foe z)r?z cos 0 dr d0 dz 
-[ cos 040 | [pr z)r?z dr dz = 0 
0 


In a similar fashion, we can show that J‘! = I??. If we redesignate I?! 
and I?? by I, and 1?? by I, we have 


ll 


= dM o) + (03)^] + 315(@3)?. 


Using Eqs. (1.28), (1.29), and (1.30), this becomes 
= 41,[(¢)? sin? 6 + (6)7] + Àl[ — 61 — cos 0)". — (1.32) 


To obtain the equations of motion, we also need an expression for the 
potential energy V. 

If we designate the total mass of the top by M and the distance of the 
center of mass from the fixed point along the axis of symmetry by h, then 
the potential energy is V = Mgh cos 0. (See Fig. 1.7.) 

The Euler-Lagrange equations of motion are of the form 


d @ 0 
————|(T—V)=0 where y — y, $, or 0. 
(io F dc ee 


The first two of these equations are 


d : ; d 
de LEY — à(1— cos0)] = E? (13003) = 0, (1.33) 





M sin? 0) ins $ —cos8)][1 — cos 0]) 2 0. (1.34) 
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zZ 







Center 
of 
gravity 





Fig. 1.7. The height of the center of gravity ıs equal to h x cos 8. 


From Eqs. (1.33) and (1.34), we get two constants of motion: 
p, = LE} — ó(1 — cos 0)] = 1505, (1.35) 
D, = Lọ sin? 0 — I04(1 — cos 0). (1.36) 
Furthermore the total energy is constant: 
E — À[()? sin? 0 + (0)?] + 41,(@3)? + Mgh cos 0. (1.37) 
If we use Eq. (1.36) to eliminate à from Eq. (1.37), we get 


[ps + 1303(1 — cos 6)]? 
21, sin? 0 





31,(6) = (E — 41;(@3)’) Mgh cos 0. 


Multiplying this last equation by sin? 0 or 1 — cos? 0, we have 


41 (sin 00)? = [E — 41,(@3)7](1 — cos? 0) 
— [po + Iso3(1 — cos 0) /Q1) 
— Mgh cos 0(1 — cos? 0). 
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If we let u = cos 0, we then have à = — (sin 0)0 and 


Q^)? = f(u), (1.38) 
where 
f(u) = I,[2E — Llw} — 2Mghuy(1 — u’) — [py + 1504 — Lou]. (1.39) 


In principle, Eq. (1.38) can be solved for u or cos 0 as a function of time. 
That result can be used in Eq. (1.36) to obtain a solution for $(t). Finally 
one can then use Eq. (1.35) to get a solution for y(t). 

Let us discuss one example. In the case of a toy top, you generally hold 
the top in some fixed position while you pull a string wrapped around the 
axis. When the string is pulled away from the top, the top is released. In this 
case the initial values of Ê and ¢ are zero. If we use a zero subscript to label 
the initial values of our variables then 0 = úo = ĝo = 0. 

From Eqs. (1.37) and (1.36) 


2E — I4(03)? = 2Mghuy and Do  I3003 = I303ug. 
Substituting these relations into Eq. (1.38) gives us 
(,) (à? = 2Mgl,h(ug — u)(1 — u?) — (103) (uo — uy. 
If we let 
2Mgl,h = a(I404)?. (1.40) 
we then have 
(LY (0) = 40$4)*(ug — uy —au? + u — (ug — a)]. (1.41) 


The roots of the last factor on the right-hand side of Eq. (1.41) are 


LLEVE dai =a) 


20 





If the top has a high rate of spin then 
a = 2Mgh/(I40,)? « 1. 


Using this approximation and then using the first few terms in the Taylor 
expansion of (1 — 4a(ug — «))*, our roots become 


Uy = ug — o(1 — (uo)”), (1.42) 


uy = 1/a — ug. (1.43) 
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Fig. 1.8. The cubic polynomial p(u) = (ug — u)[— au? + u — (ug — &)] = (ug — u)(u — uy) x 
(u, — u), where u = cos 0, u, zz ug — a[1 — (ug)? ], uz & 1/a — ug, and a = 2Mgh/(I304)? « 1. 


With these approximations, we have 
—a«u? + u — (ug — &) = (u — u4Y1 — a(u + ug)). 
Equation (1.41) can then be rewritten as 


Q4) Gi = (393) (uo — uu — uy )(L — a(u + uo) 
= (IGyo3)^p(u). (1.44) 


Since the left-hand side of Eq. (1.44) is nonnegative, physically meaningful 
values of u must be restricted to values for which the cubic polynomial p(u) 
is positive. (See Fig. 1.8.) This means that 


—-l<u,<u<u<l. 
Recasting Eq. (1.44) as an integral, we have 


E L |" dx 
© 133 Juo y/o — x)(x — uu — ax + ug) 





t 





Initially u decreases as the upper tip of the axis dips. Thus for small values 
of t, we must choose the minus sign. After u attains its minimum value, it 
then begins to increase and we then have 


ENS I, iE dx 
"o 63 Ju Suo — x) — ui) — ax + ug) 
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where 
I op dx 
1503 Ju, (ug — X(x — uY1 — a(x + ug) 














If we continue to use the high-spin approximation that æ « 1, then we can 
replace x by zero where it appears explicitly in the integral. Note: a non-zero 
value of « remains implicit in our approximation for u,. 

Using this approximation we can carry out the integration explicitly. 


One obtains 
I 2u — uy — 
[ue are sin( EM 3) z] 
1,03 ug — Uy 2 


— I 
u = cos @ = © = + oe! cos( = r); (1.45) 
2 2 L 





or 





This gives us the up and down motion of the upper tip of the axis. This 
motion is known as nutation. To obtain the rate of precession we note that 
from Eq. (1.36), 


L$C — u?) + I93u = 1,03Up 
or 


; _ 1305 (uo — u) 


3 1.46 
I, 1-w D 





To get a more meaningful expression for @, it is useful to use Eq. (1.42) to 
modify Eq. (1.45). The result is 





ge ; T (uf) 1 = cos( 28 JI (147) 


1 


In addition, it is not too difficult to show that 


IU cpu O(a). (1.48) 
1—u? 


Incorporating Eqs. (1.47) and (1.48) into Eq. (1.46) and retaining only the 
lowest power of g, we get 


$ = awaa | — cos (=s j| 
21, I, 
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u = Ug = COS 65 


u= uy = cose, 





Fig. 1.9. Path of a point on the axis of rotation for a spinning top where the fixed point of the 
top is at the center of the sphere and the distance from the fixed point to the observed point 
1s equal to the radius of the sphere. 


Using Eq. (1.40), this becomes 


$- Mon | — cos( 308 JI (1.49) 


1,03 1 





There are two things worth observing about Eq. (1.49). First: the angle 
$ is monotonic increasing or decreasing depending on the sign of o. 
Furthermore à = 0 when t = 2nI,n/(1,@3) where n is any integer. This 
occurs each time u = ug. The result is a hesitating jerky motion for the upper 
tip of the top. The path of the tip is illustrated in Fig. 1.9 where the lower 
tip of the top is at the center of the sphere. It should be noted that for a 
high rate of spin, the amplitude of the nutation is so small that visually it is 
observed only as a vibration. 

Second: when œw, decreases due to friction, the average rate of procession 
increases! 

A discussion of other solutions for the gyroscope appears in Classical 
Mechanics by Herbert Goldstein (1980). That text also contains a good 
bibliography. 


Problem 1.5. Consider the rotation operator R = By 1: J/,, defined by 
Eqs. (1.20), (1.21) and (1.23). Show that it can be computed alternatively by 
first carrying out the tilt and then rotating the top through the spin angle 
V about the axis with the direction (cos $ sin 6, sin $ sin 0, cos 0). 


Problem 1.6. In elementary physics courses, one frequently sees an ele- 
mentary solution for the rotating top derived from the simple notion that 
the rate of change for the angular momentum is equal to the torque due to 
gravity. This solution relies on a sometimes hidden assumption that the total 
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angular momentum has a direction coincident with the axis of symmetry for 
the top. This solution contains no nutation. 

The condition for no nutation is that the cubic polynomial in Eq. (1.39) 
must contain a double root. That is equivalent to the requirement that 


fuo) = 0 and f'(ug) = 0. 
Show that if one eliminates E from those two equations and then uses 
Eq. (1.36) to eliminate p,, one arrives at the equation 
I,uo(¢)? — 13049 + Mgh = 0. (1.50) 
Show that if one makes the high-spin approximation that 
Mgl,h/(10503)? « 1, 


then one of the roots for Eq. (1.50) is 


_ Mgh 


1,03 


Compare this result with Eq. (1.49). Compare this result with the solution 
that appears in more elementary texts. 


2 


A SAMPLE OF CLIFFORD ALGEBRA IN 
MINKOWSKI 4-SPACE 


2.1 A Small Dose of Special Relativity 


When the speed of sound is measured, it is found that the speed is 
independent of direction only if it is measured with respect to the air. If the 
air is moving at a rate of 20 kilometers per hour, an observer on the ground 
will discover that sound moving in the direction of the wind will move 20 
kilometers per hour faster than it would when the air is still. Similarly sound 
moving against the wind will be slowed down. 

During the nineteenth century, it was generally believed that light 
traveled through some kind of “ether” in much the same way as sound 
travels through air. In 1881, in an effort to measure the velocity of this ether 
with respect to earth, Albert Michelson designed an experiment which would 
compare the speed of light in different directions (1881). The result of the 
experiment was that the speed of light was the same in all directions. The 
experiment was refined and repeated six years later by Albert Michelson and 
Edward Morley, but the result was the same (1887). 

It might be argued that at the time the measurements were taken, the 
earth was moving downstream in the same direction and speed as the ether. 
However, in the course of a year as the earth moves around the sun, the 
earth is hurtling through space in different directions and Michelson and 
Morley got the same null result at different times of the year. 

To this day there are some holdouts who argue that the ether is dragged 
along by the earth. However, the overwhelming majority of physicists have 
abandoned the ether concept. 

Aside from the result of Michelson and Morley, there were also some 
anomalies that appeared in the study of Maxwell's equations. According to 
Maxwell's equations, a magnetic field is generated by a moving charge. But 
what about an observer who moves with the same velocity as the charge? 
For such an observer, the charge is stationary. However, if the magnetic field 
exists, such an observer should be able to measure it. 

To resolve this kind of paradox, some mathematically inclined physicists 
investigated the mathematical symmetries of Maxwell's equations. Hendrik 
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A. Lorentz (1904) Henri Poincaré (1905), and Albert Einstein (1905) 
published separate papers that presented a set of equations which have since 
become known as the Lorentz transformation. 

It was Einstein who saw the physical consequence of these equations. 
For this reason, he is generally credited with the introduction of the special 
theory of relativity. 

The Lorentz transformation can be derived from the assumption that 
the speed of light in a vacuum is a constant independent of direction for any 
observer who is moving at a constant speed relative to the source. Suppose 
we consider a spaceship flying at low level past a team of observers on top 
of Mt. Everest. Suppose further that the crew of the spaceship and the team 
of ground observers have previously agreed to set their clocks at t = 0 at 
the instant the spaceship swoops past the highest point of Mt. Everest. Also 
let us assume that at the instant that the time is zero in both the spaceship 
coordinate system and the earth coordinate system, a flashbulb is set off in 
an open window of the spaceship. 

If the crew of the spaceship then monitors the position of the front edge 
of the expanding light wave, they will find that they are dealing with an 
expanding sphere whose radius is increasing at the speed of light c. Thus if 
(t, xt, x?, x?) represents the coordinates of a point on the expanding sphere 
in a 4-dimensional space-time frame, then 


(ct)? — (x)! — (x7)? — O°)? « 0. Q.1) 


For the team of observers on the ground who also monitor the 
wavefront, the result is the same. That is, in the coordinate system of the 
earth-bound observers, the wavefront is also a sphere expanding at the same 
speed. Thus if (t, x', X°, x?) represents the coordinates of a point on the 
wavefront in their coordinate system, then 


(ct)? — Gy, — Gy! — (y! = 0. (2.2) 





Suppose the orientation of the two coordinate systems agrees at time 0 and 
the spaceship is moving with velocity v along the x!-axis. According to 
Newtonian mechanics, the relationship between the two coordinate systems 
is known as the Gallilean transformation: 


Eg (2.3a) 
X! =x! + ot, (2.3b) 
x? = x?, (2.3c) 
dix (2.3d) 


However, this system of equations is inconsistent with Eqs. (2.1) and 
(2.2). 
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In the first chapter we considered rotations in Euclidean 3-space. One 
feature of rotations is that they preserve the length of a vector. In some sense, 
we can introduce a generalized rotation in our space-time system which 
will do much the same thing. If s = (t, xt, x”, x?) represents a vector in 
Minkowski 4-space then it is useful to consider the "square of its length" 
to be 


(5,5) = (ct? — Gy? — (7)? — (oy. (2.4) 





Certainly if we could find a transformation which preserved such lengths, 
then Eq. (2.2) would follow from Eq. (2.1) and vice versa. 

To retain the equivalence of Eqs. (2.1) and (2.2), it would not be 
necessary to restrict oneself to length-preserving transformations. One could 
combine a length-preserving transformation with the following: 


t= kt, 
x! = kx’, 
x? = kx?, 
X = kx’. 


However, according to Einstein, neither of the coordinate systems should be 
preferred and thus 


t=kt=(k)*t, x! = kã! = (k}?x!, etc. 


For this reason k? = 1, and k = +1. However, for either value of k, the 
resulting transformation would be a length-preserving transformation. 

Length-preserving transformations include reflections but if the speed of 
our spaceship is decreased to zero, our transformation should reduce to the 
identity transformation. This is not possible with a reflection or with the 
product of an odd number of reflections. Therefore, if we stick with linear 
transformations, we are left with the possibility of a generalized rotation. As 
we have already seen, the formalism of Clifford algebra is particularly useful 
to handle rotations. Suppose we represent a vector s in Minkowski 4-space 
by the Clifford number 


sS = Poct + jx! + jx? T ja (2.5) 

If we construct a Clifford algebra analogous to that used for Euclidean 

3-space, we would like to have (s)? correspond to 4s, s». If we impose this 
requirement, then 


(s) = Ks, s) = I[(et)? — (xy? — G?y' — (x9). (2.6) 


However, for this equation to hold, we need a set of Dirac matrices with 
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somewhat different algebraic properties from those used for Euclidean 
3-space. In particular if (2.6) is a consequence of Eq. (2.5) then 


($9? = —(hY = —(2)? = —- (fy = I (2.7a) 
and 
$:3, P; ifizj. (2.7b) 


A standard set of matrices which satisfies these relations is 


19.07. 0 OO og 
| [01 0 0 EM CT E: 
LTEM EE ME ICM E |. 
G6 oid 10 0 
bc <0: Dhu Uis go 0 
(er Duda 0 0 0 
Ms gc ede gle PE rie n 
pt de 0-1 00 


An alternate representation can be obtained by multiplying each matrix 
91, 92, and $4 of Eq. (1.2) by —i and then relabeling the resulting matrices 
by ?,, fa, and 73. To complete the set, we can let 


00 0 i 
; 00 -i 0 
EFRA 
-i 0 00 


Recognizing the relationship between the Dirac matrices of Euclidean 
3-space and the three spatial Dirac matrices of the 4-dimensional Minkowski 
space-time geometry it is not too difficult to determine the form of the spatial 
rotation operators. Whereas in Euclidean 3-space 


0 . 8 
R — I cos j^ sins (n1$54 + n?54, + 15$,,), 
the same rotation operator in the 4-dimensional Minkowski space becomes 


8... 0, iu i TOUT T 
Oe LSU co (n 22)625) + 762.) 62.) + 16820622) 
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or 


0 0 
R — I cos T + sin 5 ("Pas + n2$4, + n°9,2). (2.9) 


In view of the result of Problem 1.4, it is possible to write this same 
operator in the form 


R= exo( 4 2) (2.10) 


where A = n5, +1793, + 1°91. 
In Minkowski 4-space, one can also consider space-time rotations or 


boosts. For example, a rotation in the x't-plane can be represented by the 
operator 


B= ew A, (2.11) 
It can be shown that an alternate representation for 2 is 
R= Leos? + jo sinh f. (2.12) 


(See Problem 2.1.) 
The inverse of @ may be computed by reversing the sign of $ or 
replacing ?,9 by ?9,. Suppose 
s = foct + 31x! + fax? + fax? 
and 


S = 8528 |. (2.13) 


Sorting out the separate components of s, we have 


ct = ct cosh $ + x! sinh $, (2.142) 
Xl = x! cosh $ + ct sinh ¢, (2.14b) 
X? = x?, (2.14c) 
B= x3, (2.14d) 


This transformation applies not just to points on the expanding sphere 
of the wavefront but to any point. The coordinates of the open window of 
the spaceship in the spaceship system are (t, x!, x?, x?) = (t, 0, 0, 0). The same 
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point in the earth-bound system is (t, x!, x?, x?) = (t, x1, 0, 0). But in the 

earth-bound system, the spaceship is observed to have a velocity v along the 
Pm 

x'-axis so 


x! = of. (2.15) 


Using these relations in Eqs. (2.14a,b) allows us to interpret the values of 
cosh $ and sinh $. In particular ct = ct cosh $ so x!, which equals ct sinh @, 
must also equal ct(sinh $/cosh 4). 

Thus from Eq. (2.15), 


sinh $ 
cosh $ 


vec 





or sinh $ = ° cosh $. (2.16) 
c 


Since a general property of these hyperbolic functions is 


cosh? $ — sinh? $ = 1, 
it follows that 

cosh? 9[1 — (v/c)?] = 1 
and 


cosh $ = ptem (2.17) 


JA — (v/c)? 


From Eqs. (2.16) and (2.17), we also have 


ee (2.18) 
JA — (o/c)? 
Using these relations, we have 
1 
qi UU (2.192) 
J/1 — (v/c)? 
xl + vt 
xix —— rue (2.19b) 
J1 = (o/c)? 
x? = x?, (2.19c) 
=x, (2.19d) 


For an everyday mundane velocity v for which v «c, this system of 
equations reduces to the Gallilean transformation of Eqs. (2.3a—d). 
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Perhaps the biggest intellectual leap made by Einstein when he intro- 
duced his special theory of relativity was the idea that the elapsed time 
between two events will be different for different observers. 

A result of this aspect of the special theory of relativity is that two 
velocities must be added in a somewhat peculiar way. 

To carry out the composition of two boosts in the same direction it is 
useful to use the exponential representation for a boost operator: 


B(d) = I cosh d + Pio sinh f = exp( 7 J (2.20) 


If 22($,) represents the transformation from the coordinate system of 
the first observer to the coordinate system of a second observer and &(,) 
represents the transformation from the system of the second observer to the 
system of the third observer, then B(¢,)A(¢,) represents the transformation 
from the first observer to the third observer. However, from the exponential 
representation, it is clear that B(¢,)B(d,) = Bd, + ,). From Eq. (2.16), 
it follows that 


v, _ sinh ¢, 


= tanh 2.21 
c cosh ¢, anes ete) 





where v, is the velocity of the first observer with respect to the second 
observer along the x-axis. In a similar fashion 


"2 Z tanh(,), (2.21b) 
C 
and 


“= tanh($; + $i). (2.21c) 
C 


where v, is the velocity of the second observer with respect to the third 
observer and v is the velocity of the first observer with respect to the third 
Observer. From the result of Problem 2.2, it follows that 


v. cosh(ó; + $2) sinh $, cosh $; + cosh $, sinh 9; 
c  cosh(ó, +) cosh $, cosh $; + sinh $, sinh $,- 





If we now divide both the numerator and the denominator of this last fraction 
by the product cosh $, cosh $», we get 


v tanh $, + tanh ¢, 


c 1-tanh $, tanh ¢, 
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That is 
v v/c t v5/c 
c 1+ (v40,/c7) 
or 
p= a Ve M (2.22) 


1+4 (v,v4/c?) 


In the world of Newtonian physics v = v, + v; and indeed in the physics 
of Einstein this is nearly true if |v,v,| « c?. A consequence of Eq. (2.22) is 
that one cannot attain speeds faster than light by adding two velocities less 
than the speed of light. 

Suppose for example an observer on earth sees a glass spaceship pass 
by at a speed of 2 the speed of light. Onboard the spaceship a passenger at 
the rear of the passenger compartment throws a frisbee forward at two-thirds 
the speed of light. According to Newtonian physics, the frisbee would be 
moving at the rate of + the speed of light with respect to the observer on 
earth. However, according to the theory of special relativity, this speed 
would be 


2c $c 12 
v= =— —— € 
1-8? 13 
For another example, suppose that the passenger turned on a flashlight. 
Onboard the spaceship the speed of the light signal would be measured as 
c. If the speed of the spaceship were v, then the speed of the light signal 
measured by the earth-bound observer would be 


v+c 
=c! 





U 
1+- 
Ç. 


Another interesting phenomenon that emerges from the special theory 
of relativity is that of Lorentz contraction. An object moving with uniform 
velocity v has its linear dimension in the direction of motion shortened by 
the factor of ./1 — (v/c)?. The dimensions perpendicular to the direction of 
motion are unaffected by the motion. To obtain this result requires some 
insight into the nature of measurement of length. 

Consider the problem of measuring the length of a rod. Suppose in a 
coordinate frame which is at rest with respect to the rod, the space-time 
coordinates of the two ends at times t, and t, are (t,, 0, 0, 0) and (t5, La» 0, 0). 
Alternatively, we can replace the time coordinate t by ct. These same 
space-time points then become (ct,, 0, 0, 0) and (ct,, L,, 0, 0). This alternate 
representation is somewhat more convenient and because of this convenience 


CLIFFORD ALGEBRA IN MINKOWSKI 4-SPACE 33 


we will use this alternate convention for representing points in Minkowski 
4-space throughout the remainder of this chapter. 

In the frame at rest with respect to the rod, one can determine the 
location of the two ends at possibly separate times t, and t, and then obtain 
the spatial distance L, between the two points. 

On the other hand, suppose we consider a second frame in which the 
rod is moving with velocity v in the x'-direction. According to Eqs. (2.14a, b) 
the space-time locations of the two ends in this frame are 


(ct, cosh $, ct, sinh $, 0, 0) 
and 


(ct, cosh $ + L, sinh $, L, cosh $ + ct; sinh $, 0, 0). 


In this second frame, the distance between the ends of the rod is a meaningful 
quantity only when the locations of the two ends are measured at the same 
time. Suppose we determine the location of the two ends at time 0 as 
measured in the second frame. Then for the first point t, — 0, and the 
space-time coordinates for that point are (0, 0, 0, 0). For the second point, 
the situation is more complicated. For the second point, ct, cosh $ + 
L, sinh $ = 0. Using this equation to eliminate t; from the non-zero spatial 
coordinate gives us the expression (0, L, cosh $ — L, sinh? $/cosh @, 0, 0) 
for the second point. Thus the spatial distance between the two ends is 


L = L,(cosh* $ — sinh? $)/cosh $ = L,/cosh $, 


L=L, J E C). (2.23) 
€ 


One aspect of special relativity which makes it fun is the many 
“paradoxes” that seem to arise. One “paradox” that arises from the Lorentz 
contraction is as follows: suppose we consider a rod which is somewhat 
longer than a vertical slot in a wall. According to the special theory of 
relativity, we should be able to give the rod sufficient speed in the vertical 
direction so as to shorten the rod down to that of the slot. If we then also 
added a small horizontal component to the velocity, the rod would pass 
through the slot. (See Figs. 2.1a, b.) 

So far so good. But what happens in the rest frame of the rod? In the 
rest frame of the rod, the rod has its full rest frame length and the length of 
the slot is shorter. How would it then be possible for the rod to pass through 
the slot? 

To understand the situation in the rest frame of the rod, compare Figs. 
22a, b. For purposes of illustration, I have constructed the picture for the 
case when the relative velocity of the rod and wall is such that the Lorentz 
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Fig. 2.1. (a) When the rod and the wall are both at rest, the rod ıs longer than the slot. 
(b) When the rod is given a high velocity ın a direction parallel to the wall with a small sideways 
drift, the rod shortens and passes through the slot in the stationary wall. What happens in the 
rest frame of the rod where the length of the slot shrinks and the length of the rod remains the 


same? 


contraction factor is 4. I have identified the direction of motion with the 
xl-axis. In this case the distance from any point on the wall to the x?-axis 
in Fig. 22b is half the corresponding distance in Fig. 2.2a. Furthermore, the 
distance of any point on the rod to the x?-axis in Fig. 2.2b is twice the 
corresponding distance in Fig. 2.2a. What becomes clear when the two figures 
are compared is that what is parallel in one frame may not be parallel in 
another frame and events which are simultaneous in one frame may not be 
simultaneous in another frame. In the rest frame of the wall, the event of the 
top end of the rod passing the top end of the slot is simultaneous with the 
event of the bottom end of the rod passing the bottom end of the slot. These 
two events are not simultaneous in the rest frame of the rod. 


Problem 2.1. Show that exp(?,9¢) = I cosh @ + fio sinh ¢. Hint: write out 
the Taylor's series for exp(),)¢) and then separate the odd and even powers 
of ¢ into separate sums. Compare the two sums with the Taylor's series for 


cosh $ and sinh ¢. 


Problem 2.2. Use the relation that exp(f199) = I cosh $ + #1, sinh $ to 
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Fig. 2.2. (a) The change of lengths occurs in the direction of motion—that is in the direction 
of the x’-axis. In the rest frame of the wall, the event of the top tip of the rod passing through 
the plane of the wall is simultaneous with the event of the bottom tip of the rod passing through 
the same plane. (b) Events which are simultaneous in the rest frame of the wall are not 
simultaneous ın the rest frame of the rod. 


show that 


cosh(¢, + $5) = cosh ¢, cosh $, + sinh $, sinh ¢, 
and 
sinh($, + $2) = sinh $, cosh $, + cosh $, sinh $;. 


Hint: use the relation that 
exp Fio(G1 + b2) = exp Pros exp 71095. 


Problem 2.3. Show that the transformation represented by Eqs. (2.14a—d) 
does indeed preserve the length of a vector as defined in Minkowski 4-space. 


Problem 2.4. Suppose 0 < v, < c, 0 < v, < c, and 


Uv, + 02 
v ZU I 
1 + (o,v2/c^) 


Show v < c. Hint: use the relation (c — v4)(c — v5) > 0. 
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Problem 2.5. Show that in general the product of two boosts is not a boost. 
(To show this, choose two boosts with different directions. Comment: in 
general it is possible to show that the product of any number of boosts and 
rotations can be written as a product of a single boost and a single rotation. 
A constructive proof of this appears in Appendix A.1. 


22 Mass, Energy, and Momentum 


A basic law of physics is that of the conservation of momentum. This law 
has a somewhat different flavor in the theory of special relativity than in 
Newtonian physics. 

In Newtonian physics, the directed distance between two points in space 
can be represented by the 3-vector (Ax, Ay, Az). Thus the entities Ax, Ay, 
and Az transform as components of a 3-vector under a change of coordinates. 
Since a time interval in Newtonian physics is considered to be a scalar, 
Ax/At, Ay/At, and Az/At can be treated as the components of a legitimate 
3-vector (Ax/At, Ay/At, Az/At) which equals the velocity vector v or 
(Ux Vy, Vz). 

In Minkowski 4-space, the directed “distance” between two events is 
represented by the 4-vector (cAt, Ax, Ay, Az). In special relativity, At is no 
longer a scalar and aside from a factor of c, Ax/At, Ay/At, and Az/At are 
ratios of different components of a 4-vector and are no longer components 
of a legitimate vector. To obtain something corresponding to a legitimate 
velocity vector, we must divide (cAt, Ax, Ay, Az) by a genuine scalar. The 
most natural scalar for our purposes is the square root of the “length” of 
the vector. In particular, if 


As = $ocAt + f Ax + $5Ay + f3Az, 
then 
(As, ASY = c?(At)* — [(Ax)? + (Ay)? + (Az)?] 
= c'(At*[1 — (v/c)^], 


or As = c(At)[1 — (v/c)? ]*. The scalar As is known as the proper time interval. 
Thus the special relativistic version of the velocity vector is 





( 1 Us/c v/c v,/C ) 
u = F 1^ 1:5 4 
[1 — (v/c)?]* [1 — eA [t — e] [1 — AT 
= (u°, ut, u?, u’). (2.24) 


The 4-vector u is known as the world velocity. Thus it should not be surprising 
to the reader to learn that Einstein's version of the momentum vector is the 
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4-momentum 
(mu?, mu’, mu?, mu?) = (mu?, p! [c, p^ Jc, p? [c). (225) 


It has been found that under any known physical interaction the sum 
of the 4-momenta for the interacting particles before the interaction is the 
same as the sum of the 4-momenta after the interaction. In highly symmetric 
situations, one can show conservation of the last three components of 
the 4-momentum is exactly what one would expect from one's knowledge 
of Newtonian physics and time dilation in special relativity. (See Problem 
2.6.) 

The faith in this conservation law is so great that if experimentalists do 
encounter an apparent discrepancy in their data, physicists will hypothesize 
some previously undetected particle to account for the discrepancy. The 
neutrino was discovered in just this manner. Wolfgang Pauli (1934) postu- 
lated a massless, chargeless particle to account for a 4-momentum dis- 
crepancy in beta decay. (There was also a discrepancy in the total spin in 
the decay process.) Since the postulated particle was both massless and 
chargeless, it was understood that it would be very difficult to detect. 
Computations showed that, on the average, a neutrino would penetrate 
3,500 light years of solid lead before interacting with matter. Nonetheless, 
physicists had enough faith in the conservation of 4-momentum to go to 
extraordinary lengths to detect the neutrino. Eventually, two American 
physicists Clyde Cowan and Frederick Reines (1953) set up huge detecting 
tanks near a nuclear-fission reactor at Savannah River, South Carolina. 
Using some very resourceful techniques, they were able to detect the elusive 
neutrino. (Technically what they detected was the anti-neutrino. Nonetheless, 
the conservation of 4-momentum was reaffirmed.) 

Let us reexamine the last three components of the 4-momentum 
vector defined in Eq. (2.25). We immediately see that one can get the 
special relativistic version of the conservation of momentum by replacing 
y = (Vy, Vy, v,) by (ut, u?, u°). However, physicists do not directly measure 
proper time with their clocks. Therefore it is more natural for most people 
to measure or compute (v,, Vy, v,) than (u!, u?, u^). Thus in the early years 
of special relativity it made more sense for most physicists to obtain the last 
three components of the 4-momentum by replacing the mass m in the 
expression mv by a velocity-dependent inertial mass m, = m/[1 — (v/cY^]*. 
However, in recent years most physicists (at least those in elementary particle 
theory) have found that their computations are generally cleaner if they carry 
out their computations in terms of the constant m (the rest mass) and 
the world velocity. This is also more in the spirit of the theory of special 
relativity. 

Till now, I have ignored the first component of the 4-momentum vector. 
If we expand mu? in a power series of (v/c)?, we get mu? = m + 4m(v/c)* + 
higher powers of (v/c)?. Of course mw? is classically interpreted as the kinetic 
energy of a particle. Also in Newtonian physics one can always add a 
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constant to the potential energy of a particle. Thus it is not totally outrageous 
to interpret mu? = E/c* where E is the energy of the particle. With this 
identification, we have 





( m mosje mv,/c mv,/c ) 
(1 — (o/c)))* A — AAE A — (v/c))*" (1 — (o/c)?* 
= (E/c?, p*/c, p’/c, p*/c). 


Computing the “length” of this vector, we have 


(m)? = (E/c^?Y — (p/c)? 
or 


E? = m?c* + p?c?. (2.26) 


This last equation is considered to be valid even for massless particles such 
as photons and neutrinos where m = 0. In the special case where v = 0, one 
gets the well-known equation E = mc?. 

If you wish to deal with complex computations involving multiple boosts 
and rotations, you will find some useful theorems and computational 
techniques discussed in Appendices A.1 and A.2. 


Problem 2.6. (This problem is adapted from the text Physics of the Atom by 
Wehr, Richards, and Adair (1984, pp. 161-165).) Suppose we hand out two 
idealized basketballs that are perfectly elastic. One ball (ball 2) is given to 
an observer on a speeding train. The other ball (ball 1) is given to an observer 
who is placed alongside the track. As the train nears the observer on the 
ground, each observer is instructed to throw his or her ball with the same 
velocity in a direction perpendicular to the track in such a way that the 
basket balls bounce off one another just as the two observers pass one 
another. 

Each observer sees his or her ball behave as if it hit and bounced off a 
perfectly elastic wall half-way between the two observers. (See Figs. 2.3a, b.) 

According to the instructions given to the observers, the 4-vectors 
representing space-time intervals of each ball in the coordinate system of the 
tosser are: 


e ball 1 in the ground (G) system: (cAt, 0, Ay, 0) for the outgoing leg 
and (cAt, 0, — Ay, 0) for the return leg; 

* ball 2 in the train (T) system: (cAt, 0, —Ay, 0) for the outgoing leg 
and (cAt, 0, Ay, O) for the return leg. 


1. Use the boost operator Z($) = I cosh($/2) + ?,9 sinh($/2) to com- 
pute the 4-vectors for the space-time intervals of ball 2 in the G 
system. 
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Fig. 2.3. Perfectly elastic collision of two identical basketballs as viewed from systems in relative 
motion. M. Russell Wehr, James A. Richards, Jr., and Thomas W. Adair III, Physics of the 
Atom, Fourth Edition (p. 180), © 1984 by Addison-Wesley Publishing Company, Inc. Reprinted 
by permission of the publisher. 
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2. Use the result of part 1 to show that observer G observes the 
y-component of the velocity for ball 2 to be (Ay/Ad[1 — (v/c)?]*. 
Suppose observer G has the prejudices of a physicist trained in 
Newtonian physics. Explain why observer G concludes that ball 2 
has a greater mass than ball 1. 

3. Show that the sum of the 4-momentum vectors for the two basketballs 
in the G coordinate system is conserved. (Note: for ball 2 in the G 
system 


(v2)? = (vy)? + (Ay/A)* (4 — (vy /c)”) 


where vr is the speed of the train. For ball 1 in the G system 
(v1)? = (Ay/At".) 


Problem 2.7. Consider a perfectly inelastic collision of two objects of equal 
mass m which collide head-on with velocities of opposite directions and equal 
magnitude and then merge into a single mass. Use the conservation of 
4-momentum to show that the rest mass of the unified object after the 
collision is 2m/[1 — (v/c)?]*. (Presumably the excess mass for a macroscopic 
collision of two clumps of clay would be lost shortly after the collision in 
the form of heat.) 


3 


CLIFFORD ALGEBRA FOR FLAT 
n-DIMENSIONAL SPACES 


3.4 Clifford Numbers in n-Dimensional Euclidean or Pseudo-Euclidean 
Spaces 


In Chapter 1, we dealt with Clifford numbers in Euclidean 3-space. In 
Chapter 2, we dealt with Clifford numbers in Minkowski 4-space. This 
chapter is devoted to showing how these notions can be extended to "flat" 
spaces of any finite dimension. 

Dirac matrices were introduced by P. A. M. Dirac to formulate his 
equation for the electron (1928). The matrices introduced by Dirac were 
specifically designed for Minkowski 4-space. However it is possible to define 
a generalized version for spaces of arbitrary dimension. This was done by 
W. K. Clifford over 100 years ago (1878 and 1882). 

It is useful to designate a set of n square matrices (51. $5, 95. .... Pny as 
an n-dimensional orthonormal Euclidean system of Dirac matrices if the 
matrices have the following properties: 


1. P,P + 34$, = 26,1, where Iis the identity matrix, ô = 1 if j = k, and 
6, — O if j #k. 

2. By taking all possible products of the n Dirac matrices, one can form 
a set of 2" linearly independent matrices. (These products may be 
written in the form M,M,...M, where M, = $, or I.) 


It should be noted that one cannot obtain any additional matrices by 
using the same fẹ more than once in a product Since 7;), = — frf, 
for j #k, it is clear that any finite product can be rewritten in the form 
cT ($,(04)* ... Oa). Furthermore since (7,,)?* = I, we can replace ($,,)" 
by J if km is even or by Îm if km is odd. 

One might ask, "Is it always possible to construct a set of matrices with 
properties (1) and (2)?" As you might suspect, it is always possible to 
construct a set of matrices with the desired properties. A method of 
construction by Kronecker products is discussed in the first section of 
Chapter 11. 
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Any linear combination of Dirac matrices is referred to as a tangent 
vector or I-vector. 
Any linear combination of products of Dirac matrices of the form 


kik2 .kp^ A A 
A Pe Yk ++ Vip 
ki <k2< <Kp 


is referred to as a p-tangent vector or more simply as a p-vector. A linear 
combination of p-vectors is a Clifford number. 

For the first seven chapters of this book, we consider only Clifford 
numbers for which every coefficient A*1*2 --*? is real. In this case the Clifford 
numbers are said to be real even though the Dirac matrices may have 
complex entries. In Chapter 7, which is devoted to a discussion of Dirac’s 
equation for the electron, we also consider Clifford numbers for which the 
coefficients are complex. In that situation, the Clifford numbers are said to 
be complex. 

An algebra consists of a vector space V over a field F (usually the set 
of real numbers R or the set of complex numbers C) with a binary operation 
of multiplication such that for all x e F and A, B, De V: 


(1) (0.97) 28 = AUB) = o( AB); 
(2) (4 + 289) = AD + 282; 
(3) D(A + B) = BA +BB. 


If, in addition, we have 
(4) (4B)D = A(BD), the algebra is said to be an associative algebra. 


Clearly the set of Clifford numbers identified with an n-dimensional 
Euclidean space forms a vector space of dimension 2" which is closed under 
matrix muitiplication. Thus the set of Clifford numbers forms an associative 
algebra. If you are genuinely surprised to learn that this algebra is called a 
Clifford algebra, your ability to anticipate the obvious must be suspect. 

If the numbers of a Clifford algebra are restricted to real Clifford 
numbers, the algebra is said to be a real Clifford algebra. 1f the members of 
a Clifford algebra are allowed to be complex, then that algebra is said to be 
a complex Clifford algebra. (That should not surprise you either.) 

We now turn to pseudo-Euclidean spaces. What is a pseudo-Euclidean 
space? What distinguishes a pseudo-Euclidean space from a Euclidean space 
is the fact that the scalar product of a non-zero vector with itself is not 
necessarily positive for a vector in a pseudo-Euclidean space. On the other 
hand, it is always possible to find an orthonormal basis {e}, €23, ..., e,) fora 
pseudo-Euclidean space with the property that 


<e, €,» = +1 fork =1,2,3,...,n, (3.1) 
and 


Cene —0 forj sk. (3.2) 
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An example of a pseudo-Euclidean space is the Minkowski 4-space 
which was discussed in Chapter 2. One important feature of pseudo- 
Euclidean spaces is the signature matrix nj, where 


ny = (ej, e. (3.3) 


From Eqs. (3.1) and (3.2), it is clear that ny, is a diagonal matrix whose 
diagonal elements are each + 1. From a theorem proven by Sylvester (Cartan 
1966, pp. 5-6), it is known that regardless of what orthonormal basis is used 
to span a given pseudo-Euclidean space, the number of positive entries and 
the number of negative entries on the diagonal of the signature matrix are 
each invariant integers. For example, regardless of the orthonormal basis 
chosen to span Minkowski 4-space, one diagonal element of the signature 
matrix will be +1 and the other three diagonal elements will be — 1. 

A Clifford algebra for a pseudo-Euclidean space is generated from a set 
of Dirac matrices in much the same fashion as it is done for a Euclidean 
space. We designate a set of n matrices {},,)2,...,,} as an orthonormal 
pseudo-Euclidean system of Dirac matrices if the members have the following 
properties: 


(D) 2,9 + $43, = 2nd; (3.4) 
(2) by taking all possible products of the n Dirac matrices, one can form 
a set of 2" linearly independent matrices. 


To construct an n-dimensional system of orthonormal pseudo-Euclidean 
Dirac matrices, one simply selects an appropriate subset of an n-dimensional 
orthonormal Euclidean system of Dirac matrices and multiplies each If them 
by i, that is by wid. 

There are two comments that should be made here. One is that some 
mathematicians and physicists (mostly English) use the convention that 
P, fr + 9.9; = —2nyJ. In particular, this convention is used by Ian R. 
Porteous (1981, pp. 240-241) and by Roger Penrose and Wolfgang Rindler 
(1984, p. 124). Also many physicists use a reverse metric for Minkowski 
4-space. That is, they use the signature (—, +, +, +) instead of the signature 
(bL 

The second comment is that in his pioneering work on Clifford algebra, 
Marcel Riesz stated and “proved” a theorem to the effect that property (2) 
in our definition is implied by property (1) (1958, pp. 10—12). It is now 
recognized that this theorem is false (Porteous 1981, p. 243). Nonetheless, 
the method of the proof by Marcel Riesz can be used to prove a corrected 
version. Restated in a corrected form, the theorem states that if the product 
$195... În Is not a scalar multiple of the identity matrix I, then property (2) 
is implied by property (1). This is proven in Section 10.1 of Chapter 11. 

When n is even, the product 7,25... f, anticommutes with 7, and thus 
cannot be a scalar multiple of I. However, for odd values of n and certain 
signatures, it is possible to have a system of n orthonormal matrices such 
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that their products span a space of only 2" ! dimensions instead of 2" 
dimensions. 

In his text Topological Geometry, Ian Porteous refers to a system of 
Clifford numbers generated by a system of n Dirac matrices satisfying 
property (1) but not necessarily (2) as a Clifford algebra (1981, p. 240). He 
then refers to a system of Clifford numbers generated by a system of n Dirac 
matrices satisfying both properties (1) and (2) as a universal Clifford algebra 
(1981, p. 240). Except in Chapter 11, the only Clifford algebras discussed in 
this book are what Porteous calls universal Clifford algebras. 

I am not convinced that so-called nonuniversal Clifford algebras are 
worthy of serious study. There are two reasons for this. For those interested 
in the geometric aspect of Clifford algebras, the geometric interpretation of 
p-vectors becomes nonsense for nonuniversal Clifford algebras. For those 
interested in the algebraic aspect of Clifford algebras, it should be observed 
that any nonuniversal Clifford algebra stemming from a Euclidean or 
pseudo-Euclidean space of n dimensions is isomorphic to some universal 
Clifford algebra stemming from a Euclidean or pseudo-Euclidean space of 
n — 1 dimensions. This is a point that should not be ignored by the serious 
physicist. 

In treatments of Dirac's equation, the product + 75,7293 or cijofij2j4 
is frequently labeled as y,. In this context, it is understood that y; is a 
notational shorthand for a product of Dirac matrices. However, some 
physicists have pushed this labeling beyond such a notational shorthand. In 
a partially successful attempt to incorporate gravitational and electro- 
magnetic fields into a single metric, Theodor Kaluza (1921), introduced a 
fifth dimension. With that thought in mind, some physicists have attempted 
to modify Dirac's equation for the electron by using one of the y;'s defined 
above. However, the resulting complex Clifford algebra is identical to the 
complex Clifford algebra associated with Minkowski 4-space. 

I don’t know enough about these theories to say with complete 
confidence that this use of y4 is wrong. After all, the fifth dimension in 
Kaluza's theory is not quite on a par with the other four dimensions. 
However, the physicist using y, in this manner should be aware of the option 
of using a universal Clifford algebra associated with a 5-dimensional 
pseudo-Euclidean vector space. 


Problem 3.1. Suppose $, = o, for k = 1, 2, and 3. Explain why this set of 
Dirac matrices can be used to generate the real universal Clifford algebra 
for Euclidean 3-space but not the complex universal Clifford algebra 
for Euclidean 3-space. (For definition of the three Pauli matrices, see 
Problem 1.2.) 


32 Dirac Matrices in Real Euclidean or Pseudo-Euclidean Spaces 


In an n-dimensional real Euclidean space or pseudo-Euclidean space, one 
can represent any point in space by a linear combination of orthonormal 


CLIFFORD ALGEBRA FOR FLAT n-DIMENSIONAL SPACES 45 
Dirac matrices which we will refer to as a position vector: 
S= 9x": (3.5) 


Note: unless specifically indicated otherwise, we will use the standard 
Einstein convention in which a repeated index appearing in both upper and 
lower positions indicates a summation. In the expression above: 


n 
s= SN 
a=1 


Corresponding to any given coordinate system (ul, u?,...,u"} there is 
a corresponding system of n linearly independent Dirac matrices which will 
be referred to as a coordinate system of Dirac matrices: 


a ax? 
has hos) fora — 1,2,...,n. (3.6) 


To indicate a coordinate system, we will use Greek indices and omit the caps 
on the Dirac matrices. If the system is not necessarily a coordinate system, 
we will use Latin indices. For those situations where we specifically wish to 
indicate an orthonormal system of Dirac matrices, we will continue to use 
caps on the Dirac matrices. 

A useful example of a coordinate system in Euclidean 3-space other than 
the usual {x, y, z} system is the spherical coordinate system. For this system, 
we have 


s = fax + $3y + 932 = fir cos sin 0 + f,r sin $ sin 0 + far cos 0. 
From Eq. (3.6), it is clear that the coordinate Dirac matrices are 


os 


Ria COS ¢ sin 0 + 7, sin $ sin 6 + $4 cos 0, (3.7) 
r 
ôs nag T 
ye = E fir cos $ cos 0 + Îr sin $ cos 0 — ĵar sin 0 (3.8) 
and 
ôs ; ; ; 
y S — jir sin $ sin 0 + $;r cos ¢ sin 0. (3.9) 


For any kind of reasonable coordinate system, the system of matrices 
generated by Eq. (3.6) can be used as a basis for an n-dimensional vector 
space at most points. However, at some points this may not be true. For 
example in the case of the spherical coordinate system y, = 0 for any point 
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x 


Fig. 3.1. The direction and the length of the spherical coordinate Dirac matrices depend on 
the particular point with which they are associated. 


on the z-axis. Thus the spherical coordinate system of Dirac matrices does 
not span a 3-dimensional space at those particular points. 

Usually both the magnitude and direction of the y,’s will be dependent 
on the spatial coordinates and therefore a given set of y,’s will be identified 
with a particular point in space. This is clearly true in the spherical 
coordinate system just discussed. The y,’s take on meaning only when they 
are evaluated at a particular point in space. 

Physicists frequently encounter vectors which are functions of their 
location. One obvious example is the vector used to represent an electric 
field. For this reason it might be natural to refer to the y;'s as “field vectors." 
However geometers have adopted an alternate terminology. Following their 
terminology, the vector space spanned by the ys identified with a given 
point is called the tangent space at that point. A vector in that space is 
described as a tangent vector. 

To get a physical insight into these notions, it may be useful to refer to 
Fig. 3.1. Point P is defined by the position vector s. Furthermore the three 
tangent vectors 7,, yy, and 7, span a 3-dimensional tangent space at the 
point P. 

At first thought these "tangent vectors" may not appear to be tangent 
to anything. However, at the point (ro, 05, Go), Yọ and yy are tangent to the 
sphere r = ry. The Dirac matrices y, and y, are tangent to the cone 0 = 0,. 
And finally y, and y, are tangent to the plane $ = dp. 

For the spaces discussed so far, the coordinate basis is an orthogonal 
basis. However this is not true in general. This will be discussed further in 
Chapter 5. 


Problem 3.2. For the 3-dimensional Euclidean space, compute the system of 
Dirac matrices corresponding to the cylindrical coordinate system {p, 0, z} 
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where in terms of the usual Cartesian coordinates (x, y,z], we have 
x = pcos 0, y = p sin 0, and z = zZ. 


Problem 3.3. Suppose a = y,A” and b = y, B*. Show that the symmetric form 
ab + ba is a scalar (a 0-vector). Hint: use Eq. (3.6) along with Eq. (3.4). 


3.3 The Metric Tensor and the Scalar Product for 1-Vectors 


An arbitrary vector a can be written in the form a = A/y,. The result of 
Problem 3.3 makes it possible to define a scalar product (a, b> of two vectors 
a and b. We note that since ab + ba is a O-vector, we can define <a, b> as 
the coefficient of J that occurs in the following equation: 


l(ab + ba) = <a, b» I = (b, a). (3.10) 
Two symmetric matrices arise from this definition of the scalar product 


of two 1-vectors. One is the signature matrix already encountered in Section 
3.1 which is related to an orthonormal basis as follows: 


ny = My = Qu (3.11) 


The second symmetric matrix is known as the metric tensor g,,. This matrix 
arises from a given coordinate basis of Dirac matrices. In particular: 


Gap = Ipa =. us y8?- (3.12) 
For the flat Euclidean or pseudo-Euclidean spaces discussed so far, the 


transformation used to obtain the coordinate system of Dirac matrices from 
the signature matrix is a simple matter. In particular since 


Ya Yy ĝu” > 
Eq. (3.12) can be rewritten as 
o. fox M Oxy 
gag = Qs Pe? (EES) 


0x! ôx" 
"Ht Out Ou 





or 


Gap = (3.13) 


The inverse of the matrix g,, is designated by g”. Since Jag 1S symmetric 
with respect to its two lower indices, the inverse matrix is also symmetric 
with respect to its two upper indices. That is, g^? = g**. 

This inverse matrix can be used to construct a biorthogonal basis 
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for the space of tangent vectors at any given point. This alternate basis 
(y, y?,...,y"} is designated by indices in the upper position and is defined 
by the relation: 


y*=g®y; fora=1,2,...,n. (3.14) 


We note that (75, 755 = g" Yas yg? = g^", OF 


Q5 yg? = ó5, (3.15) 
where 
1 dfa- 
à; -| im (3.16) 
0 ifazf 


To distinguish the y^'s from the y,'s, we will label the y*’s as upper index 
Dirac matrices and the y,'s as lower index Dirac matrices. 

It is because of Eq. (3.15) that the two bases are said to be biorthogonal. 

The distinction between the two bases is particularly important for 
coordinate systems of Dirac matrices. In that case, the position of the indices 
determines how the Dirac matrices are transformed under a change of 
coordinates. Consider a change of coordinates from {u', u’,..., u"} to 
owes sa. 

In the two coordinate systems: 


ds = y, du” = 7, du. 











Thus 
és Os 
= and jg = —- 
"t Qu* n Qu? 
Since 
Os 2 Qu* Os 
an? ü! ôu” 
it is clear that 
ĝu” 
Fe = — Ya: 3.17 
Tp = aoa) (3.17) 
It also immediately follows that 
Ou" Qu" 





gap = Pus 32? = Vis Vy? 0ü* oa 
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Or 
Qu" Qu" 
Qi o oc 3.18 
Gap av EP EPI ( ) 
By contrast, it will be demonstrated that 
Qu* Gu? 
gj = g" — —. 3.19 
2 Qu" du” ym 


To verify Eq. (3.19), we need to show that the right-hand side of Eq. (3.19) 
is the inverse matrix of ggg. That is, we must show that 
Ou Ou? 
53 = dgoQg"" —— —. 
PT Po Qu" ðu” 


But this is equivalent to the equation: 


: Ou* ðu’ du" Ou? 
a= (ne a)" i ic 





To condense the right-hand side of Eq. (3.20), we note that 


( Qu* s " out Qu? _ " Qu? Qü* s s) 
95 ou) Qu^) 9 Burau 99 a? Qu (ou Ow 


= qug Qu? Out (+) 
= Jég 


0i? du" ou” 














(68) 











Thus we have verified Eq. (3.20) and thereby Eq. (3.19). 


50 CLIFFORD ALGEBRA 
Using the fact that 7% = g"?55, it is not too difficult to show that 


Cu" 
y = yf —. 3.21 
ce dicun (321) 
(See Problem 3.4.) 
In a similar fashion, the reader should not have difficulty demonstrating 
that if 


a= A^, = Ap = A’y, = AY, 


then 
z Qu* 
D = a (3.22) 
ĝu” 
and 
3 Qu 
A, = B og" (3.23) 


It is important to note the position of the indices and the bars in Eqs. 
(3.17), (3.18), (3.19), and (3.21). We observe that when we change the system 
of coordinates, the upper index components A’ of a vector a undergo the 
same transformation as the upper index Dirac matrices. Similarly the lower 
index components A, of a vector a undergo the same transformation as the 
lower index Dirac matrices. 

In the presence of a non-singular metric, the upper index Dirac matrices 
span the same space as the lower index Dirac matrices. In the absence of a 
nonsingular metric one can no longer use Eq. (3.14) to define upper index 
Dirac matrices. 

It is interesting to observe that even in the absence of any metric where 
there is no Clifford algebra some of the mathematical structure can be 
retained—at least in an analogous form. In the absence of a metric, one must 
use the formalism of differential forms. 

It is worthwhile to compare the formalism of differential forms with that 
of Clifford algebra. In the formalism of differential forms, one represents a 
tangent vector as a linear combination of the entities (0/0x^). These entities 
are analogous to a coordinate system of Dirac matrices and undergo the 
same transformation when a change is made in the coordinate system. In 
the formalism of Clifford algebra, one could define the upper index Dirac 
matrices by the requirement that (7^, y,> = ó5. In the formalism of differ- 
ential forms, an analogue of the upper index Dirac matrices is introduced in a 
slightly different way. In place of 7^, one introduces a /-form dx* which is 
defined as a linear mapping from the space of tangent vectors to the set of 
real numbers. In particular 


dx*: 0/üx? > à; so dx*: AP(0/0xP) > A*. (3.24) 
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These 1-forms span an alternate vector space with the same dimension as 
the space of tangent vectors. Rather than being biorthogonal, the 1-forms 
dx* form a basis which is said to be dual to the coordinate basis formed by 
the (0/O0x^ys. Furthermore the space spanned by 1-forms is also said to be 
dual to the space of tangent vectors. 

In the formalism of Clifford algebra there are two natural ways of 
representing the same vector for a given coordinate system: 


a= A*y, = Apy’. 


From the relation yf = g**y,, the reader can show that A* = g” Ap. 

When a non-singular metric is introduced into the formalism of differ- 
ential forms, one has a natural isomorphism between the coordinate basis 
of the space of tangent vectors and the coordinate basis of the space of 
1-forms. Namely: 


dx* e g**(0/dx*). 


Under this isomorphism, the tangent vector A"(0/0x^) is mapped onto the 
1-form A,(dx*) where 


A* LL g Ag. 


In both formalisms, a change in coordinates implies a transformation 
in the components A* and Aș given by Eqs. (3.22) and (3.23). 

To distinguish the way they behave under a change of coordinates, the 
A,’s are described as being components of a covariant tensor of order one. By 
contrast the A*s are described as the components of a contravariant tensor 
of order one. In the same spirit, the components of the metric tensor are 
components of a covariant tensor of order two. 

In the context of Clifford algebra, this terminology is a little strange 
because the A's and the A?s may be components of the same entity. 

Generally we must be sensitive to the order of the indices for the 
components of a covariant (or contravariant tensor). Generally Asg # Aga- 
For this reason, we must be sensitive about the vertical alignment of the 
indices for a mixed tensor. We cannot assume that A,’ = A®,. We note that 
Ing A, = Aay while g,5 AP, = Apa. Thus if Ag,  A,, then A,’ # Af, 

One exceptional case where we do not have to make this distinction is 
the Kronecker delta symbol. Since 6,’ = 6°,, we will save space by simply 
writing ôf. We will also align the indices in a similar manner when we 
encounter a generalized version of the Kronecker delta symbol in the next 
section. 

Another exception also appears in the next section where we use a 
combination of coordinate and noncoordinate indices. 
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Problem 3.4. Use the fact that 7* = g*^7, to verify Eq. (3.21). 
Problem 3.5. Show that if A,y* = A?y,, then 


Af = g" A, (3.25) 
and 
A, = g,5 AP. (3.26) 


Because of Eqs. (3.25) and (3.26), g^? and g,, are sometimes referred to 
respectively as the index raising operator and the index lowering operator. 


Problem 3.6. Prove <y%, yf 5 = g”. 


Problem 3.7. Components of a tensor can be characterized by their trans- 
formation properties. For example A", is a component of a mixed tensor of 
order two if 

, Ou” Ou 


At emt ms 
; Qu? Qu" 





Using that criteria show ó5 is a component of a mixed tensor of order two. 


Problem 3.8. Show that if a = A"y, and b = B"y,, then <a, b> = A'B'g,, = 
A, B,g”? = APB, = A,B*. 


Problem 3.9. Use Eq. (3.13) to show that for spherical coordinates g,, = 1, 
Joo =, Gop = T? sin? 0, and gy, = O if j # k. 


Problem 3.10. Compute the inverse of the metric tensor computed in 
Problem 3.9. Then use the result and Eq. (3.14) to compute y”, y®, and y® in 


terms of 7,, fa, and 73. 


Problem 3.11. Show that for Euclidean coordinates 


ander dy fm ef, | 
Deeg Spee cae (6x4)? (x2)? * (x3)? E 


Problem 3.12. Show that if 








ou" 
y* — B — 
UT 
then 
Ó Fa 
¥— =f 
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Problem 3.13. Show that 


(rie : Ta T 
E ag 
[2422412 EE 
Or? rór r^sin000 00/ r° sin? 0 ad? 


(Use the results from Problem 3.19.) 








Problem 3.14. For cylindrical coordinates x! = p cos 0, x? = p sin 0, and 
x? = z, compute y,, yo, and y, in terms of fı, fa, and 3. Use this result to 
compute the components of the metric tensor and then the components of 
the inverse tensor. Then compute y”, y®, and y7. Finally show that 


ject care] mnes 
1 60 Bel [us op a deut] 


Problem 3.15. An index free Clifford number . is a Clifford number which 
contains no unsummed indices. An example of such a Clifford number is the 
3-vector J = AP yg Ys where A*^, is a component of a tensor. Demon- 
strate that an index free Clifford number transforms under a change of 
coordinates as a scalar, that is, 2 = 49',5,5,3* = A = AP y yy". 








3.4 The Exterior Product for p-Vectors and the Scalar Product for 
Clifford Numbers 


As already indicated in Section 3.1, a product of p distinct orthonormal Dirac 
matrices or a linear combination of such products is referred to as a p-vector. 
However, this does not mean that the product of p distinct 1-vectors is 
necessarily a p-vector. For example, suppose 7,, fa, and ?4 are orthonormal 
with (f1)? = (9; = (93)? = I. Then if 


a= A!j, + A?) A and b= B'j, + B^j; + B43, 
we have 
b = (A+B! + A?B? + A®BS)I + (4? B? — A? B?) PaP 
+ (A®B* — A*B*) $39, + (A'B? — A?B*) 992. 
Thus it is not difficult to see that in general the product of two 1-vectors is 


a linear combination of a 2-vector and a O-vector or scalar. However, a 
2-vector can be constructed by taking a linear combination of ab and ba. In 
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particular 


Mab — ba) = (AB? — AB?) $3}, (B! — AB) 9x9, 
+ (ALB? — A*B)}, 45. 


More generally, suppose we consider p distinct Dirac matrices each one 
of which is expanded in terms of an n-dimensional orthonormal basis; that 
is, y, = Wij, for v = 1,2,..., p. Then 

VaY2 Yp = WEWE... WPI hes Dp (3.27) 
The terms on the right side of Eq. (3.27) are of several types. If };,, 25, .. .. 9), 
are all distinct then their product is a p-vector. On the other hand, if some 
of the 7,'s in a given term are identical then like pairs of the orthonormal 
Dirac matrices may be grouped together and multiplied out until the only 
remaining Dirac matrices in that term are all distinct. Thus we see that the 
right-hand side of Eq. (3.27) consists of a linear combination of p-vectors, 
(p — 2)-vectors, (p — 4)-vectors, and so forth, on down to l1-vectors or 
0-vectors. 

One can project out the p-vector component on the right-hand side of 
Eq. (3.27) by antisymmetrizing the product. To do this explicitly, it is useful 
to introduce the generalized Kronecker delta symbol 5;'';,._'?,. This symbol is 
defined to be +1 if the upper indices are all distinct and the sequence of 
lower indices is an even permutation of the sequence of the upper indices. 
The same symbol is equal to —1 if the upper indices are distinct and the 
sequence of lower indices is an odd permutation of the upper indices. For 
all other cases, the generalized Kronecker delta symbol is zero. 

With this generalized delta function, we can define y,,,,.,, as the 
antisymmetric product of y,,, Yy,» .. -> Yy,- In particular: 


1 
^ = nin " 
P viva Vp (5 one a Yni?us +++ Yup 


1 
= "mu Mp Yi num Wir 3 $ 5 
n ( On vp Fm nas) I mp Inta: Vip 


p! 
We Wi ... WH 
WWE i CAD eL. 
i ben det i . InfacssTugs- 1328) 
P- : : 
We Wi ... Wi 


It should be noted that if y,,, Yv» ..., Yv, are linearly dependent, then 
each determinant on the right-hand side of Eq. (3.28) will be zero and thus 
the antisymmetric 7,,,, v, will be zero. 
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For the special case for which p — n, the same determinant appears on 
the right-hand side of Eq. (3.28) n! times. Thus we have 


Wi Wi... Wi 
312... = det PaPa- Pr (3.29) 


The determinant on the right-hand side of Eq. (3.29) may be considered 
to be the volume of the n-dimensional parallelepiped spanned by the tangent 
vectors Yi, Yz- - -> Yn- 

The ordinary matrix product of a p-vector and a q-vector results in a 
linear combination of vectors of the orders p + q, p + 4 — 2, p + q — 4,..., 
on down to p — q. However, for some purposes, it is useful to drop the lower 
order forms. With this in mind, one defines the exterior product of a p-vector 
and a q-vector as the projection of the ordinary matrix product onto the 
space of vectors of order p + q. In particular 


^ = l _ 6 iB2. pta 
Yviva op Vain eha 0 (p 4 4)! ViV2.+.VpMin2-- nq’ B1B2-»-BpBp+1Bp+2 --Bp+a 
= Verve. -Ypmin2-. Nq (3.30) 


where it is understood that y,,,,.. ,, = 0 if any two indices are identical. 
This can be generalized a little bit. Suppose 


LL 
F = a pEi 


dp 


and 


1 E 
4 = q! gum TOR: -Jq° 


Then 


1 , 
F ^ G = EY”? PGI 


— (3.31) 
p: dq. 


1pJ1J2«.-J47 


It should be noted that Eqs. (3.28), (3.29), and (3.30) are valid for both 
coordinate and noncoordinate systems of Dirac matrices. 

Any matrix product or linear combination of such products can be 
decomposed in a unique way into a linear combination of p-vectors of 
varying orders. Suppose {},,)2,...,%,} is a set of orthonormal Dirac 
matrices. Then 7,2; — 9, if iz j and $;?; =n I if i=j. This can be 
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summarized by the equation: 
$3,734 nl. (3.32) 


(Note: one and only one of the terms on the right-hand side of Eq. (3.32) is 
nonzero.) 

Consider the product 7;),?,. If all three indices are distinct then 
99s? = fig. If the first two indices are identical then $;,), = nij,. If iz j 
and i= k, then 7,7,9, = — $; 3, = —na$,. M iz j, i x k, and j= k, then 
Pf, fk = naf, If we write 


eT = Pik + nD, nd ny, + ni, (3.33) 


we note that this equation is also valid for the case in which i =j = k. 
Using similar arguments: 


VAT = Duk + T n = Tha im T um’ jk 

zh Wim ae Num? ik T Das) F Djs d 

— My Ay l + nang. (3.34) 
To adjust these formulas for a coordinate basis it can be shown that one 
merely replaces the components of the signature matrix by the components 
of the metric tensor. To demonstrate this, we observe that 

Yale? y = Wi Ws W*S.D je 

Wi, W3 WEP s. ou Dk m ny, iE ny] 


ll 


or 


Ya BYv = Vagy + JapYy — Gav¥p + ÜpvYo- (3.35) 


Of course any one of the equations written down with lower indices has 
a corresponding version with upper indices. Corresponding to Eq. (3.28), we 
have 


y? Jp — (=) ó712 eyhyne y" 
p! 1112. .tp 


and corresponding to Eq. (3.35), we have 
Py = yt + gy” — gey + gye. (3.36) 


Since a product of Dirac matrices can be decomposed into a sum of 


CLIFFORD ALGEBRA FOR FLAT n-DIMENSIONAL SPACES 57 


p-vectors, the same can be said for any Clifford number. For example, 
consider the spherical coordinate system applied to Euclidean 3-space. 
Suppose we designate the Dirac matrices by y,, ys, and y,. Then an arbitrary 
member . of the associated Clifford algebra can be written as a linear 
combination of p-vectors with lower indices up to order 3: 


A = AI + AY, + AY + Ay, + A Yog 
+ Ayar + AM yg + A Yog. (3.37) 


Similarly one could write the same Clifford number in terms of p-vectors 
with upper indices: 


A = AI = A,y" + Agy? + Apy? + Aggy 
+ Apy” + Ary? + A). (3.38) 


The set of pure p-vectors can be treated as a vector space. The natural 
basis for such a vector space is formed by constructing antisymmetric 
products of p distinct coordinate Dirac matrices. The dimension of this space 


_f{n\. ; : 

is since that is the number of ways that p objects can be chosen from a 
p 

set of size n. For the example above, the space of 2-vectors has dimension 


3 ; 
B = 3 and is spanned by yog, yg, and 7,9. The same space is also spanned 


by the 2-forms y*9, y?", and y”®. 

By taking the direct sum of the different p-vector spaces, we arrive at 
the vector space of all Clifford numbers. As previously indicated the 
dimension of this larger space must be 


5(”) =F (") 171"? = (1 + 1 = 2". 
p NP p AP 


In Section 3.3, we defined the scalar product for a pair of 1-vectors. We 
now turn to the problem of defining the scalar product of any two Clifford 
numbers. To deal with this, we use two concepts. The first notion is the 
scalar component of a Clifford number. Following the notation used by 
David Hestenes and Garret Sobczyk (1984, p. 3), we designate the scalar 
component of a Clifford number s by (£) For example, from Eq. (3.37) 
or (3.38), we have (£) = A. 

The second notion is that of the reverse of a Clifford number. Continuing 
to follow the notation used by Hestenes and Sobczyk (p. 5) and the 
terminology of Marcel Riesz (1958, p. 13), we define the reverse «7! of a 
Clifford number . to be that Clifford number obtained by reversing the 
order of all products of Dirac matrices in the linear expansion of <£. 
For example if / is the Clifford number that appears in Eq. (3.37), 
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then 


a = AI + AY, + A'Y + AT, + Aygo 
+ AP yg + A" Yor + Arey por 
= AI + Avy, + Ay, + A9y, — A Yog 
AF 4, A”) A o: (3.39) 





We can now define the scalar product of two real Clifford numbers £ and 
28 to be 


LA, BY = (4B). (3.40) 


Problem 3.16. Harley Flanders (1963, p. 14) defines the scalar product of 
two p-forms in a somewhat different manner. The Clifford algebra analogue 
would be 


I RED! aa oy tas 


QS. PS a uu Aem 


(gitz: tp yn d» = det (3.41) 


(etos c0 aM asin: Qe aes 
For a slightly more general case, suppose 


— 182... 4 
A es Aii, à tp? 7 


l 


and 
MB ege A 
then 
Qu BY Aor Buc Qa ce I dp, 


Show that for pure p-vectors, the definition of Hestenes and Flanders agree. 
Suggestion: try using an orthonormal basis. 


Problem 3.17. Demonstrate that (122), = (Bt), where «7 and B are 
arbitrary Clifford numbers that are not necessarily index free. (This shows 
that (oZ, BY = (£8, A X.) 


Problem 3.18. Show that (7B). = (AW), where £ and B are not neces- 
sarily index free Clifford numbers. 
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Problem 3.19. Generalize the result of Problem 3.18 to show that the scalar 
component of a product of several Clifford numbers is invariant under a 
cyclic permutation, that is 
(Ay... Ay) = (yA... A, As) o 
= (A. He... AA, Ay... Ay). (3.42) 


4 


CURVED SPACES EMBEDDED IN HIGHER 
DIMENSIONAL FLAT SPACES 


4.0 Why you may wish to skip Chapter 4 


If you have a cursory knowledge of differential geometry, you may wish to 
skip this chapter. This chapter is devoted to looking at some aspects 
of m-dimensional surfaces embedded in n-dimensional pseudo-Euclidean 
spaces. Much of the chapter is further specialized to 2-dimensional surf- 
aces embedded in E?. We discuss these surfaces to introduce such notions 
as curvature, Christoffel symbols, and parallel transport. Also on these 
surfaces, it is possible to motivate the requirement that a derivative opera- 
tor be torsion free. Nonetheless, if you have encountered these concepts 
elsewhere, you should be able to skip this chapter without loss of con- 
tinuity. 


4.1 Gaussian Curvature and Parallel Transport on Two-Dimensional Surfaces 
in E? 
The simplest example of a curved surface is the ordinary 2-dimensional 
sphere in Euclidean 3-space. Furthermore the geometry of the sphere serves 
as a major motivation for much of the mathematical work that has been 
done for more general surfaces. For this reason it is useful to examine some 
basic results associated with the geometry of a sphere. 

Given two points on a surface, a geodesic is a path of minimum length 
passing through the two points.* For a sphere, a geodesic is a segment of a 
great circle—that is, a circle formed by the intersection of the sphere with 


* Most mathematicians will cringe at this definition with good reason. When one travels by 
plane from New York to London by a great circle route over the Atlantic Ocean, one is following 
a geodesic. However 1f one continues the flight over the same great circle, one is still following 
a geodesic even though the shortest route from London to New York is not over Turkey. 
Furthermore in Minkowski space, a geodesic is a path of maximum length rather than minimum 
length. Nevertheless we will rely on the intuition and common sense of the reader until we have 
developed enough mathematical machinery to give a more sophisticated definition in Section 
4.3 of this chapter. 
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Fig. 4.1. The intersection of two great circles forms a lune. 


a plane passing through the center. The intersection of two great circles forms 


a lune. (See Fig. 4.1.) 


The area of a sphere is 4nr?, where r is the radius. The area of a lune 
is the fraction 0/27 of the sphere. Thus the area of a lune is (0/21)(4nr?) or 


2r?0. From this result, it is not too difficult to determine the area of a triangle 
formed by three great circles. Such a figure is known as a spherical triangle. 


Theorem. (Refer to Fig. 4.2.) The area of AABC = [(A + B + C) — x]r’, 


where A, B, and C are the vertex angles measured in radians. 


Proof. (For shorthand purposes, we will designate the area of a figure simply 





Fig. 4.2. The spherical triangle ABC is formed by three great circles. 
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by the label for the figure.) 


AABC + AA'BC = 2r7A_ (this is the area of lune ACA'BA); 
AABC + AABC' = 27?C; 
A ABC + AAB'C = 2r7B; 

AA'BC' — AAB'C =0 (by symmetry). 


We now observe that AABC + AA'BC + AABC' + AA'BC' equals the 
area of the upper hemisphere in Fig. 4.2 which is 2rr?. Making use of this fact, 
we can add up the two sides of the four equations above and obtain the 
result that 


2^ABC + 2nr? = 2r*(A +B + C). 
From this, it immediately follows that 


Area of AABC = [(A + B + C) - n]r?. (4.1) 
B 


The quantity 1/r? is referred to as the Gaussian curvature of the surface 
and the quantity (A + B + C) — z is called the spherical excess of ^ ABC. 
Thus the product of the area of AABC and the Gaussian curvature is the 
spherical excess. For figures more general than spherical triangles, it is useful 
to express the product of the area and the Gaussian curvature in slightly 
different terms. 

Suppose we consider the problem of transporting a vector tangent to 
the spherical surface around the perimeter of the spherical triangle without 
rotation. When we say "without rotation," we mean without rotation 
detectable to an observer who is required to make all measurements in the 
surface of the sphere. 

In a flat 2-dimensional plane we can move a vector from point A to 
point B by sliding it along a straight line while maintaining a constant angle 
between the vector and the straight line. Such a displacement is called parallel 
transport. (See Fig. 4.3.) 

On the surface of a sphere, the analogue of a straight line is a geodesic 
or great circle. With that in mind, we now try to move a vector around the 
perimeter of a spherical triangle by parallel transport. (Refer to Fig. 4.4.) 
Suppose we consider vector v, tangent to geodesic AB at vertex A. Moving 
the vector by parallel transport along AB, it becomes vector v, at vertex B. 
Continuing the process, the vector becomes v3 at vertex C and vector v, at 
the original location vertex A. This is perhaps most easily visualized for the 
example in which A is the North Pole and BC is a segment of the equator. 
In this case our parallel transported vector will be pointing South at all 
times during the motion. 
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Fig. 4.3. Vector v, is the result of a parallel transport of v, along the path AB. 


Although the vector is moved by parallel transport along all points of 
the perimeter, we discover that when the vector is returned to its original 
location, it has become rotated counterclockwise from its original orienta- 
tion. Referring again to Fig. 4.4, it is not too difficult to convince oneself 
that this angle of rotation is (A + B + C) — x which is exactly the spherical 
excess of the spherical triangle. Thus if « represents the angle rotated by the 
vector parallel transported about the complete perimeter of the spherical 


2». 





Fig. 4.4. The vector v, 1s parallel transported along the great circle arc AB to v;, then along 
the great circle arc BC where it becomes v3. Finally it 1s parallel transported along great circle 
CA where it becomes v4. Careful computation reveals that v, in the guise of y, has undergone 
a rotation of magnitude A + B + C — m. 
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Fig. 4.5. When », is parallel transported around the path OABCAO ıt becomes v,, then v3, 
and then finally v4. The angle between v, and v, is the same as the angle between v; and v4. 


triangle then: 


a = (1/r?)(Area of AABC) 
= (Gaussian curvature of sphere)(Area of A ABC). (4.2) 


This last result can be generalized in several ways. It is possible to show 
that for a fairly broad class of closed paths, the angle of rotation for a vector 
parallel transported around a member of that class is equal to (1/r?) 
multiplied by the area enclosed by that path. Consider Fig. 4.5. Suppose all 
arc segments are great circles and the closed path is OABCAO. Suppose 
vector v, parallel transported along OA becomes v, at A and then v, at A 
after it is also parallel transported around the spherical triangle ABC. Then 
finally suppose v, is parallel transported into v, when it is carried along the 
last leg of our closed path from A to O. It is not difficult to see that the 
angle between v, and v, is the same as the angle between v; and v4. To get 
that result, we merely note that on the last leg of our journey from A to O 
not only does v, parallel transport into v4 but v, parallel transports into v,. 
For this reason the angle of rotation for the closed path OABCAO is the 
same as the angle of rotation for a vector simply parallel transported about 
the perimeter of the spherical triangle. 

It is not too difficult to generalize further. For example, if the region 
encompassed by the closed path can be decomposed into a finite sum of 
spherical triangles then the angle of rotation caused by parallel transport of 
a vector around the closed path will still equal (1/r?) multiplied by the area 
surrounded by the closed path. (See Problems 4.1 and 4.2.) 

One can also define the notion of parallel transport of vectors along 
paths which are not geodesics. If the path is reasonably well behaved then 
one may consider it to be a limit of a sequence of polygonal paths consisting 
of geodesic segments. One could determine the parallel transport of a vector 
on each of the polygonal paths in the sequence and then take some 
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appropriated limit. However, this approach leads to massive computations 
which I would like to avoid. For this reason, we will return to this problem 
with a different approach in Section 4.3 of this chapter. 

Meanwhile, we have created enough machinery to consider the curvature 
of 2-dimensional surfaces in E? which are not necessarily spherical. For a 
sphere, the Gaussian curvature is (1/r?) and is constant. More generally, a 
2-dimensional surface has a Gaussian curvature which varies from point to 
point. One way of defining Gaussian curvature is to make use of the notion 
of parallel transport of a vector in much the same fashion that it has been 
just discussed. If the surface is smooth near a given point x, then we can 
choose two other points y and z such that there exist three geodesic segments 
forming a geodesic triangle with non-zero area. One of the three geodesic 
segments would have end points x and y, another end points y and z, and 
of course the third segment would have end points z and x. If K(x) is used 
to designate the Gaussian curvature at x, then 


K(x) = lim «/(Area of geodesic triangle), (4.3) 


d>0 


where x is one of the vertices of the geodesic triangle, d is the length of the 
longest edge, and « is the angle of rotation for a vector parallel transported 
about the perimeter of the geodesic triangle. 

Let us apply this definition of Gaussian curvature to a couple of 
examples. The most trivial example is a flat surface. Consider a flat sheet of 
paper. On such a sheet of paper geodesics are straight lines which can be 
constructed with a pencil and ruler. In this case a “geodesic triangle" is an 
ordinary triangle. When we parallel transport a vector about such a triangle, 
the result is a zero rotation. Thus by the definition that appears in Eq. (4.3), 
we get a zero value for the Gaussian curvature. This 1s indeed no great 
surprise. 

What is somewhat less obvious is the fact that if the paper is bent without 
stretching, the resulting surface still has a zero value for its Gaussian 
curvature at all points. If for example the paper is bent into a cylinder or a 
cone, the edges of a penciled triangle are no longer straight lines in 3-space. 
Nevertheless they are still geodesics in the surface of the paper. The angles 
at the vertices remain the same and therefore the parallel transport of a 
vector about the triangle would still result in a zero rotation. Using Eq. (4.3), 
we would still get a zero value for the Gaussian curvature. 

An observer who is constrained to take all measurements in the surface 
of the paper would not be able to distinguish between the flat sheet of paper 
and the bent sheet of paper. Perhaps we should insert a condition or proviso 
in this last statement. If the paper were infinite and it was bent into the shape 
of an infinite sine wave, then the statement would indeed be truc. (See Fig. 
4.6.) On the other hand, if the paper were rolled into a cylinder or cone there 
would be some geodesics along which the observer could travel in one 
direction and eventually return to some point encountered previously on the 
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Fig. 4.6. A piece of paper which 1s bent without stretching would be locally indistinguishable 
from a flat surface to an observer constrained to take all measurements in the surface of the 


paper. 


trip. In this situation the observer could safely conclude that we were 
not dealing with a flat sheet of paper. However, if we require the observer 
to take all measurements not only in the surface but also in some sufficiently 
small local region then it would indeed be impossible for the observer to 
determine whether or not the local region was flat or bent in 3-space. 

In this chapter, we have so far considered the sphere which has positive 
Gaussian curvature and some other 2-dimensional surfaces that have 
zero Gaussian curvature. On the other hand there are surfaces with a 
negative Gaussian curvature. (See Fig. 4.7.) 

On the spherical surface of the earth, two meridians which appear 
parallel at the equator converge and intersect at the North and South Poles. 
By contrast, on a surface of negative curvature, geodesics which appear 
parallel at one location diverge from one another. (See Figs. 4.7 and 4.8.) As 
a result triangles formed from geodesics have the property that the sum of 
their angles is less than n. (See Fig. 4.9.) As a result a vector which is parallel 
transported around the perimeter in the counterclockwise direction will be 
rotated in the negative or clockwise direction. 





Fig. 4.7. On a surface of negative curvature, geodesics which appear parallel at one location 
diverge from one another 
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Fig. 4.8. Diverging geodesics from Fig. 4.7. 


P 


Fig. 4.9. A triangle formed from geodesics on a surface of negative curvature. The sum of the 
three angles is less than z. 


Problem 4.1. Consider Fig. 4.10. Suppose ABC and DEF are spherical 
triangles. Convince yourself that the angle of rotation caused by the parallel 
transport of a vector along the path OABCAODEFCBDO is the same as 
the angle of rotation caused by the parallel transport of a vector along path 
OABDEFCAO. 
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Fig. 4.10. On a spherical surface, the amount of rotation caused by parallel transport along 
the path OABCAODEFCBDO is the same as the rotation caused by parallel transport along 


the path OABDEFCAO. In both cases, the amount of rotation is (1/r?) x (area of figure 
bounded by the curve ABDEFCA). 
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Problem 4.2. Consider a closed path on the surface of a sphere which is 
composed of a finite number of segments of geodesics. Suppose also that the 
region encompassed by the closed path may be decomposed into a finite 
sum of spherical triangles. Use the result of Problem 4.1. to convince yourself 
that the angle of rotation caused by the parallel transport of a vector around 
such a closed path is equal to (1/r?) multiplied by the area encompassed by 
the closed path. 


4.2 The Operator V, on an m-Dimensional Surface Embedded in an n- 
Dimensional Flat Space 


Consider an n-dimensional Euclidean or pseudo-Euclidean space. In such a 
space we have a position vector s which can be used to derive all tangent 
vectors 


GSS) NE (4.4) 
k=1 


As in Chapter 3, it is understood that the j,'s are orthonormal. That is 
(9 = -IVk and $,);--$,9,— 0 iff sk. 


Now suppose we consider an alternate coordinate system ut, u*,..., u". 
Then 


x! = x!(u!,u?,..., u") 
x? = x?(u!, u?,...,u") 


(4.5) 
x" = x"(ul, u?,... , u”). 


From this last set of equations, we can obtain the lower index Dirac 
matrices corresponding to the u* coordinate system: 


pth ey: (2n. (4.6) 


Ou* = = NOu* 





(In this section, we will use barred entities when we discuss the full 
n-dimensional space and unbarred entities when we discuss lower dimen- 
sional surfaces. Since the range of summation of the indices will vary, we 
will also add the summation sign in equations where the range of summation 
may not be obvious.) 

The corresponding contravariant Dirac matrices will be determined by 
computing the inverse matrix of the metric tensor g,, and then using the 
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resulting matrix g^" as an index raising operator. In this fashion, we obtain 
the equation: 


P=} gfe (4.7) 
B1 
For the example of spherical coordinates in E?, 


Ss = 
k 


ie 


x*j, = r sin 0 cos 9f, + r sin 0 sin df, + r cos 04, (4.8) 
1 


ð 
y= B = sin 0 cos $9, + sin 6 sin f, + cos 093, (4.9) 
: 


Q . . 
yo ham r cos 0 cos Q3, + r cos 0 sin df, — r sin 054, (4.10) 


00 
and 
7 os p ] ; E 
737 —r sin 0 sin $$, + r sin 0 cos $$;. (4.11) 
The metric tensor is 
1 0 0 
0 r 0 
0 0 rsin?0 
and the inverse matrix is 
1 0 0 
0 1/r? 0 


0 0  I1/(r?sin? 0) 
Therefore from Eq. (4.7), we have 


F= P= and F° = (1/0? sin? 0) 7p. (412) 
In the general case, a differential tangent vector at a given point can be 
written as 


ds = Y. 7, du’. (4.13) 
a=1 


Given any coordinate system, we can construct an m-dimensional surface 
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by setting the last n — m coordinates equal to some appropriate set of 
constants. (To be perverse, we will refer to the example of the sphere in which 
it is the first coordinate “r” that is set equal to some constant.) The 
points on such an m-dimensional surface are parameterized by the first m 


coordinates. That is 


4 
xl =xl(ul,u7,...,u™ oe" emt)... ce"), 
x? = x7(ut,u?,...,u% ctl em*2,,, LL ch), 
(4.14) 
x^ = x"(ul,u?,... u^, c" 1o cmt. cP). 


When one parameterizes a given surface in this way or any other way, 
one generally encounters difficulties. One cannot usually hope to construct 
a single set of functions which will parameterize an entire surface and be 
well behaved at all points. For example, in the usual parameterization of the 
sphere y, — 0 at the poles where 0 — 0 or x. In general a single set of 
parameterizing functions is useful only in some local area. To treat an entire 
surface, one must usually patch together several parameterizations on 
overlapping areas. In what follows, we discuss only one parameterization on 
a given surface so the results apply only to a local region of the surface. 

A regular point on our m-dimensional surface is a point at which the 
coordinate system of Dirac matrices is linearly independent. At any regular 
point on our m-dimensional surface, there is an m-dimensional plane which 
is tangent to the surface at that point. Furthermore, if the given point is 
considered to be the origin of the plane, then this tangent plane is spanned 
by the first m 7,’s. In this context, we have an m-dimensional space of tangent 
vectors which is indeed tangent to something. 

In spherical coordinates, every point on a sphere is regular except the 
two poles. At any regular point, the tangent plane is spanned by yọ and yọ. 
(See Fig. 4.11.) 

In more general cases, a differential tangent vector in our m-dimensional 
surface may be written in the form: 


ds= Y j, dx”. (4.15) 


ai 


If we designate the lower index Dirac matrices in the new space by y, for 
& —1,2,...,m, then 


Ya = Ya for «= 1,2,...,m. (4.16) 


Furthermore, since gyp? = 3(y,yg + 7574), we can obtain the new metric 
tensor by simply suppressing the indices of g,, which are greater than m. 
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Fig. 4.11. The plane tangent to the sphere is spanned by 7, and yọ- 


That is, 
Jap = Jap fOr a, P = 1,2,...,m. (4.17) 


The computation of the new upper index Dirac matrices is not as simple. 
Although the complete set of n upper index 7"'s spans the same space as the 
complete set of n lower index 7,’s, it is not necessarily true that the first m 
7" span the same subspace as the first m 7,’s. To determine the new set of 
upper index Dirac matrices, one must first compute the inverse matrix of 
the new metric tensor g,,. We can then obtain the new upper index Dirac 
matrices by the relation 


y= y gy, fora=1,2,...,m. (4.18) 
B1 


To illustrate some of these points, let us consider an example of a 
2-dimensional plane in E?. In E?, the position vector s can be written in the 
form 


s = xj,  x^$4 + x. 
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(a) (b) 


Fig. 4.12. (a) The tangent vectors 7;, 72, and 73 form a coordinate system of lower index Dirac 
matrices for the set of coordinates (u!, w?, u?] where x! = u! + u?, x? = u? + u?, and x? = u? 
(b) The upper index Dirac matrices 7!, 7°, and 7? are biorthogonal to the system of lower index 
matrices shown in (a), that is, the scalar product (7,, 7^» = 68. 


Suppose we introduce an alternate set of coordinates {u’, u?, u?), where 
xl =u! +u’, x? =u +u, and x? = ap. 


In this alternate coordinate system, 


s= (u! + 0), + (u? + u°)’ + 0795 
and 


ds — 21 du! T 32 du? t (i + 24 + $4) du?. 


Since 7, = ós/Ou*, the members of the coordinate system of lower index 
Dirac matrices are 


ELE 72 = Po and 73 — 34 +92 t $34. (4.19) 


(This system of lower index vectors is illustrated in Fig. 4.12a.) 
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Using the relation that 9,,1 = 3(7,75 + 757,), we find that 


101 
(Fap) =} 0 1 1 (4.20) 
1 1 3 
Computing the inverse of g,,, we get 
2 1 —1 
(g^) = i 2 —14. (4.21) 
—j] -1 1 
Using Eqs. (4.21) and (4.19), we get 
F = 27 +7- J = fa 
P =F, +27 J = fa Pa (4.22) 


P= Ji h tI = Fs. 
We note that the lower index and upper index Dirac matrices form a 
biorthonormal system of vectors. That is, 


Go ^» = OF. (4.23) 


Equation (4.23) may be checked algebraically using Eqs. (4.19) and 
(4.22). Alternatively, we can obtain a pictorial image of the situation by 
carefully comparing Fig. 4.12a with Fig. 4.12b. We note, for example, that 
3! is perpendicular to the plane spanned by 7, and 73. 

If we now discuss the surface defined by u? = 0, we are in effect talking 
about the xy-plane. At each point on this plane, the tangent space is spanned 
by y, and y;, where 


93-33-15 and V2 = 2 = f2. (4.24) 
Examining Fig. 4.12b, we see that neither 71 nor 7? lie in the tangent 


space at any point on the xy-plane. To compute x! and x?, we first 
recognize the fact that our new metric tensor is simply the 2 x 2 identity 


matrix, that is 
1 0 
(Gap) a P l 2 


Thus the inverse matrix (g^^) is also the 2 x 2 identity matrix. Therefore, 
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from Eq. (4.18), we have 


y —75,—75 and y, = ya = 72. (4.25) 


Another example worth considering is the surface of a sphere. Using the 
usual spherical coordinate system, the metric tensor g,, is diagonal. For this 
situation, we can obtain the elements of the contravariant tensor g^" by 
simply suppressing the same indices that are suppressed to obtain g,, from 
gap- In particular, for the 2-dimensional surface of the sphere embedded in 
E?, we have 


Yo = yo and 7e = Te (4.26) 


The metric tensor for the curved surface written in matrix form is 


r? 0 
ap) = : 4.27 
(Gen) lo r? sin? ij Shen 
The inverse matrix is 
1/r? 0 | 
ay : 4.28 
e) | 0 1/(r? sin? 0) ony) 


So 


y-(/r)y-7 and  3*-(4/(^sin*0)y, = y*. — (429) 


A member of a coordinate system of Dirac matrices, when considered 
as a tangent vector, can have different directions and different lengths at 
different points in space. For example, on the surface of the globe, y, has the 
direction that is labeled “South”. However the direction “South” for a 
resident of Greenland is quite different from the direction “South” for a 
resident of Japan. Furthermore y,, which has the direction “East” at all 
points of the globe, has a magnitude of r sin 0 where 0 is the angle from the 
North Pole as shown in Fig. 4.11. Thus we see that these tangent vectors 
are functions of the points at which they are evaluated. 

When a tangent vector is evaluated on a sequence of points along some 
path, we can consider it to be undergoing a change as it is moved along that 
path. In this context, it makes sense to ask how fast does the tangent vector 
change or what are its derivatives. From Eqs. (4.9), (4.10), and (4.11), we 
find that 


05, : ; i 
a = —r sin @cos $f, — r sin @ sin $$, — r cos 0$, 
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Or 
Ove : 
= —r. 4.30 
38 rj (4.30) 
In a similar fashion, we can compute 
Ov, cos _ 
B i (4.31) 
0Ó sind 
Oy, _ cos _ 
= 4.32 
00 sind’* nd 
Vy _ :22 nz . = 
oó = —rsin* 05, — sin 0 cos O75. (4.33) 


These quantities represent the rates of change as seen by an observer 
who is permitted to take measurements in all three dimensions. However, 
an observer who is constrained by take measurements on the surface of the 
sphere would not observe any changes in the 7, direction. Such an observer 
would detect only the changes in the 7, or jy direction. 

Therefore if we wish to compute the rate of change of y, or y, on the 
surface of the sphere as seen by our 2-dimensional observer then we must 
suppress the y, component. Thus, in place of 0/00 and 0/04, it becomes 
necessary to define two new operators V, and V,. Suppressing the 7, 
component in Eqs. (4.30), (4.31), (4.32), and (4.33), we get 


Voyg = 0 (4.34) 
cos @ 
V, ya = -——— 4.35 
ove am Yo ( ) 
cos 0 
Voya = —— 4.36 
ove sin 0 Yo ( ) 
Vo = —sin 0 cos 07,. (4.37) 


In the general case, the situation is somewhat similar. In the context of 
the entire space dy,/du’ can be expanded as a linear combination of the 7,’s. 
Therefore we may write 


0y,/0u? = Y. TyF (4.38) 
1 


However, in the general case, when we wish to compute the corresponding 
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rates of change observed by an m-dimensional observer in an m-dimensional 
surface, we cannot simply suppress the last n — m indices. The reason for 
this is that if we do not start out with an orthogonal coordinate system, then 
the space spanned by the last n — m y,'s may not be orthogonal to the space 
spanned by the first m y,s. What we wish to do is to project dy,/du’ onto 
the subspace spanned by the first m y,s. To do this we would like to 
decompose 07,/Ou^ into a sum of two tangent vectors, one of which is in the 
subspace spanned by the first m 7,’s and the other is in the orthogonal 
complement. The orthogonal complement is spanned by the last n — m 
contravariant js. Thus when this decomposition is possible, we may write 


0), 
Qu^ 





= VU pvt A Aw) (4.39) 
y=1 


v-m-i 


Such a decomposition is not always possible. (See Problem 4.3.) In a 
pseudo Euclidean space, there are vectors which are orthogonal to them- 
selves. For example, suppose ĵo and ?, are members of an orthonormal basis 
where (fo)? = — (f1)? =I. If v = fo + $,, then <v, v» = 0. Such vectors have 
been variously referred to as isotropic, null, or light-like. 

Given an m-dimensional subspace, the orthogonal complement is a 
subspace of dimension n — m. However, the m-dimensional subspace may 
contain vectors which are not only orthogonal to themselves but to every 
other vector in the subspace. When this is true, these same vectors are also 
contained in the orthogonal complement. In this situation, the m-dimensional 
subspace together with the (n — m)-dimensional orthogonal complement do 
not span the entire space. 

We will avoid opening this Pandora's box by restricting ourselves to the 
case for which the metric tensor for the m-dimensional subspace has a 
non-zero determinant. (See Problem 4.5.) 

When the determinant of (g,,) is zero, the Clifford algebra that 
accompanies the m-dimensional plane is said to be singular. The nature of 
singular Clifford algebras is discussed at length in a set of mimeographed 
notes by Marcel Riesz (1958). 

To compute the I,"s from Eq. (4.39), we can take advantage of the fact 
that the y”’s (not the 7”’s) span the same subspace as the first m j,'s. Thus 


(Oy, /0u*, y"? = x LIO, y»? t by Asp y? 
n=1 n=m+1 
=I," ð; 
and therefore 
Tag” = (oyy/0u*, YY- (4.40) 


Now if we project out that component of 0y5/0u* which is detectable to 
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the m-dimensional observer, we get 
Vaya = ck bow fora, 8-42. (441) 
v-1 


The T,,,"’s are known as Christoffel symbols. Christoffel symbols are not 
tensors because they do not have the required transformation properties 
under changes in the coordinate system. This point will be developed further 
in the next chapter. 

In most presentations the formula used for Christoffel symbols is quite 
different from Eq. (4.40). The usual formula is 


g" (0g, , 0g, a 
Tp = na 4 Ong Rap] 442 
i 2 (us Qu* | du" (4.42) 





To derive Eq. (4.42) from Eq. (4.40), we first observe that 


Oy, — Os — Oy, 
ðu”  Qu*'Ouf Qu. 





(4.43) 


Also y” = g"',. With these thoughts in mind, we have from Eq. (4.40) 


gus s y s / 9t Oy, 
I x g (Qu. n) Tm 39 (s. is) + (ĉi, i.) 
Oy, | 6 Oy, 
(2k, n) E Qu^ Yeo Ya? o. ou 


for arbitrary c and å, we see that 


g"í ô oy [o oy 
I, * == —— |- ? y. 3 + a? zu a? —1 . 
575 (2, Vp» Yn? e But) ap Te)? 7 ess 


Using the relation that <y., y5? = Jap and repeating the use of Eq. (4.43) 
with a different set of indices, it immediately follows that 


g"(0gg, , 0g, oy OY, 
T, (ERN NE 4 + " : ELLA ES 2 
=a (2 dut P gu" Va oyn 


r va g" {Ste OG an = E) 
= 2 N05 Ouf eur) 








Since 











and 
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The fact that the Christoffel symbols can be written in terms of the 
elements of the metric tensor for the m-dimensional surface serves to 
underline the point that V,y, is an entity which is measurable by our 
m-dimensional observer. 

Reviewing our definitions, V,y, represents the orthogonal projection of 
0yg/Ou* or 6?s/du* Ou? onto the m-dimensional plane which is tangent to our 
m-dimensional surface at some given point. Since the same thing can be said 
for V,y,, we can write 


Vag = Vera: (4.44) 


For purposes of future generalization in the next chapter, Eq. (4.44) is 
an important relation. Meanwhile, we note that an immediate consequence 
of Eq. (4.42) or alternatively Eqs. (4.41) and (4.44) is the symmetry of the 
Christoffel symbols in the lower indices, that is 


Dj => D. (4.45) 


The operator V, can be applied to entities other than the y,'s. Suppose 
we consider a Clifford number of the form 


F = Fim "r(ul, oH M rts 
Then we define V, as 


V,F = (VE calla) Yva vp T posu duds vp 


F puso Oo us MaR Yvp +... + ROE ds iii (Va Yv): 
(4.46) 


In Eq. (4.46), it is understood that 


ô 
V pueveul RVR Vp 4.47 
g am (4.47) 


The definition of V,Z that appears in Eq. (4.46) may be regarded as the 
projection of oF /0u" onto the space of Clifford numbers associated with a 
given point on the m-dimensional surface. (See Problem 4.4.) 

The operator V, behaves very much like the operator 0/Ou". However, 
there is one major exception. Unless the m-dimensional surface is intrinsically 
indistinguishable from a flat subspace, V, V, # V,V,. In particular 


(V, V; = Vo Vad) Yy = Rag! Yr (4.48) 


where R*,,, is known as the Riemann curvature tensor. As one might suspect 
from the name, this tensor is a measure of the curvature of the m-dimensional 


CURVED SPACES 79 


surface at any given point on that surface. This point will be developed 
further in the next chapter. 


Problem 4.3. Consider Minkowski 4-space where (7,)? = (72)? = ($,)? = — 
and (fo)? = —I. Consider the 1-dimensional subspace spanned by fo + ,. 
Show that the orthogonal complement is 3-dimensional. Show that the 
vector fọ — $?, cannot be decomposed into a multiple of fọ + ?, and a vector 
in the orthogonal complement. 


Problem 4.4. This last section was devoted to the construction of tools for 
the study of m-dimensional surfaces embedded in an n-dimensional Euclidean 
or pseudo-Euclidean space. At each point on the m-dimensional surface, we 
have an m-dimensional plane or hyperplane spanned by {7,, 72,-.-, Ym} OT 
{y',y7,.-.,7}. When that same point is considered as a member of the 
larger n-dimensional space, we may identify with that point an orthogonal 
complement space of tangent vectors. This is the (n — m)-dimensional space 
spanned by (7"*1,5"*2...., 5". 


on f m\ .. : 
In a similar fashion, we have a ( ) dimensional space of p-vectors 


p 
which can be identified with each point on the m-dimensional surface. 


This is the subspace of p-vectors OU by Clifford numbers of the 
form y, ^ Jy, ^... A Y, OY Ay? ^... A y^ where 1 & v <v €... « 
vp & m. 

(1) Use Eq. (3.38) or (3.39) to show that the orthogonal complement of 


p-vectors mentioned above is spanned by p-vectors which have the 
form 


Yor A Yu Anes A Vy A PETE A PAA LAPP 


where 0< k <p, 1 Svi <v <... <v, Sm, and m 1x v,,« 
Yk42 Z- L Vp SKN. 

Suppose F = F'" "y, A Yy, A... A Yy Compute OF /du* and 
then use Eq. (4.39) and result (1) to decompose 6¥/du* into two 
p-vectors, one of which is in the space spanned by 


Q 


wa 


{Yo A Pug ^^y] Where Iv, «v4 «...«v,&m 


p 


and another which is in the orthogonal complement. 

Use Eqs. (4.41) and (4.46) and result (2) to show V,4 is the 
projection of à 4 /Ou* onto the subspace of p-vectors spanned by 
Clifford numbers of the form y,, ^ Yy, A... ^ Yy, where 1 & v, < 
V2 €... € vy & m. 


G 


— 


Problem 4.5. Show that if the determinant of (g,,) is nonzero, then there is 
no vector (other than 0) in the space spanned by the m y,s which is 
orthogonal to that space. Hint: suppose there exists such a vector v. Then 
y = A*y, and 0 = (v, Ya) = A'Qy,, yg? = A" gag. 
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Problem 4.6. Use Eqs. (4.34), (4.35), (4.36), (4.37), and (4.48) to show 
that for the surface of the sphere Rosap = —r? sin? 0, and Ryg’? = —1/r?. 


Problem 4.7. Use Eq. (4.48) and Eq. (4.41) to show that 


6 ô : 
R y m. i — —; Le TT! - 5 Ve 

af ou B Qu? B à BA 
Problem 4.8. Since V,y? is a tangent vector, it can be expanded as a linear 
combination of y"'s, that is, V,7’ = C,,^y". Show that C,’ = —T,,*. Hint: 
use the relation that 68 = (y,, y^» and apply the operator V, to both sides 
of that equation. 


43 Parallel Transport on an m-Dimensional Surface Embedded in an 
n-Dimensional Flat Space 


Having defined the operator V,, it now becomes possible to define the notion 
of parallel transport of a vector along a curve in an m-dimensional surface 
embedded in an n-dimensional flat space. 

A point on such a curve may be represented by the position vector 


n 


x(s) — T x'(ul(s), u?(s), .. . , u"(s))f,. (4.49) 


i-1 


To compute the vector tangent to the curve in the direction of increasing s, 
we merely compute the derivative: 
dx = & Dx dut du’ 
— = Yı = 
ds g=1.21 ĝu” ds ds 





Ya (4.50) 








where 





An observer in the large n-dimensional space who wished to compute 
the derivative of a Clifford number ¥ with respect to s would simply use 
the formula 


dz ôF du” 


= . 4.51 
ds Qu* ds ( ) 











However, an observer constrained to take all measurements on the 
m-dimensional surface would detect only components of dF /ds projected 
onto the space of Clifford numbers identified with the tangential plane. Thus 
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such an observer would write 
V.F = V,F (du%/ds). 
With this definition of V,, we have 
V,yg = (Va yg Xdu*/ds) = T,,”y,(du*/ds). (4.52) 


An index free Clifford number F which is parallel transported along a 
curve x(s) (not necessarily a geodesic) is one for which 


VF =0. (4.53) 


We can now give a more sophisticated definition of a geodesic than that 
given in Section 4.1. A geodesic is a curve x(s) which can be parameterized 
in terms of a variable s in such a way that the tangent vector dx/ds is parallel 
transported along the curve; that is 


V, (=) = Vix(s)- 0. (4.54) 
ds 


It should be emphasized that the parameterization of a geodesic may be 
such that Eq. (4.54) does not hold. This is most easily visualized if s is thought 
of as a time variable. Then x(s) represents the position of a point which 
moves along our surface as time advances. The tangent vector dx/ds then 
represents the velocity vector. Of course we can adjust the parameterization 
so that the point moves with a changing speed. Clearly if the magnitude of 
the velocity is changing then the acceleration V2x(s) measured even by an 
observer constrained to the surface is not zero. A geodesic on a surface may 
be considered to be a path such that a point moving along it with constant 
speed will experience no acceleration or change in direction which is 
detectable to an observer who is constrained to take all measurements on 
that surface. 

To obtain the equations for the coordinates of a geodesic, one can use 
Eqs. (4.50) and (4.52) to get 


B 
V2x(s) = (n =) 





ds 
du* du^ d?u? 
- (V, $ 
(Vag) ds ds " ds? 





» (r ,dw ete) 
TE ds ds ds? jJ“ 
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Therefore the equations for the coordinates of a geodesic may be written as 


d^u* ro de duf — 


aq UE 4.55 
ds? ^ ds ds (629) 





Of course if the u*’s are Euclidean coordinates in a Euclidean plane, 
then the Christoffel symbols are zero and the equations ar easily solved. One 
should not be too surprised to observe that in this situation the general 
solution for a geodesic is an arbitrary straight line in the m-dimensional 
Euclidean plane; that is 


u* = c*s +u% fora=1,2,...,m, 
where the c*'s and u$'s are arbitrary constants. 


Problem 4.9. From Eqs. (4.34)-(4.37), one can read off the values for the 
Christoffel symbols for the surface of a sphere. Use these values and Eq. 
(4.53) to treat the problem of parallel transporting a vector around a 
four-sided figure on a sphere formed by two parallels and two meridians. 
That is, first parallel transport the vector A*y; + A*y,, “south” along the 
path 0 =s + 0s, à = o from s — 0 to s= 0, —0,. Then parallel trans- 
port the vector “east” along the path 0 = 0,, 6=s+ o from s — O to 
s = $, — $o. Then parallel transport the vector "north" along the path 
0 = —s + 0, $ = $, from s=0 to s = 0, — Oo. Finally parallel transport 
the vector back to its original position by moving it “west” along the path 
0 = 0, and ġ = —s + $, from s = 0 to s = $, — do. 

Several checkpoints for the computation are as follows. For all path 
segments, one should get 


e 6 
Cnt A om AP Sn beeen” zo 


00 ds | 0$ ds ds 





and 





$ 0d 
0A GO ge AN ais dó _ 
00 ds sin 0 ds sin 0 ds 


Integrating these equations along the first leg of the rectangular loop, one 
should get 


=A and A®sin@ = A$ sin 0 


where (A$, A$) are the components of the vector in the northwest corner of 
the loop. 
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Integrating along the second segment, one should get 


A? = A? cos[(ó — Po) cos 0,] + (4$ sin 0,) sin[(ó — $4) cos 6,] 
and 


A? sin 0, = (A4? sin 0,) cos[(d — $o) cos 0,] — 48 sin[($ — $4) cos 06,] 


where (A5, A?) are the components of the vector in the southwest corner of 
the loop. Note: the parallel transport along the curve of constant latitude 
results in a clockwise rotation with respect to the curve of constant latitude. 
This is the same direction of rotation that would result if you approximated 
the parallel by a sequence of geodesics. 

When you have completed the parallel transport of the vector about the 
rectangular loop, you should be able to show that the resulting angle of 
rotation is ($, — $9)(cos 0, — cos 0,). It is an easy matter to compute the 
area of the rectangular loop so that one can see that the result for this 
problem is consistent with Eq. (4.3). 


5 


THE USE OF FOCK-IVANENKO 2-VECTORS 
TO OBTAIN THE SCHWARZSCHILD METRIC 


5.1 The Operator V, and Dirac Matrices in Curved Spaces 


In Chapter 10, it will be shown how to construct a system of ortho- 
normal Dirac matrices for an arbitrary signature matrix. That is, given any 
signature matrix nj, it is possible to construct a set of Dirac matrices such 
that 


"n T iM = 2nj I. (5.1) 


Using this result, it is not too difficult to construct a set of coordinate Dirac 
matrices for a more general metric tensor. 

Suppose we denote the matrix corresponding to the matrix tensor g,, 
by G. That is 


Jit S12 -+ Jin 


G= in J22 +> xs 
gui n2 RIO Inn 


An arbitrary metric tensor is real and symmetric, that is g,5 = gg,. From 
matrix theory it is well known that a real symmetric matrix can be 
diagonalized by a real orthogonal matrix. Thus we have 


G = O0! DO (5.2) 


where O is an orthogonal matrix; OT is the transpose of O which is identical 
to O~1; and D is a diagonal matrix. Furthermore, the diagonal elements of 
D are real and may be ordered so that all the positive components are 
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identified with the first r indices. That is 
(^ 
(a)? 


D= (a)? 


(Gre oe 


= (rt) 


where r + s = n. Clearly D can be decomposed further into a product of the 
form 


D = AnA (5.3) 


where n is the signature matrix and 


a 0 ... 0 
"e 0 a, 0 
0 Ve TOR 


The signature matrix n is of course uniquely determined by the metric 
tensor. 
From Eqs. (5.2) and (5.3), we have 


G = OTATnAO = W'nW where W = AO. 
Thus 
Jag = wi Wing. (5.4) 


If we now define y, = W77,; then 


Vale + Veta = Wi WiDr t5) 
= 2WiWin,l 
Sigal: (5.5) 


To obtain the contravariant Dirac matrices, we merely write 


y* = g”). (5.6) 
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In Chapter 4, we discussed m-dimensional surfaces embedded in an 
n-dimensional Euclidean or pseudo-Euclidean space. In that context V,y; 
was defined to be the projection of 0y5/Ou* onto the m-dimensional tangent 
space spanned by y,, y2,..., Ym at a given point on the m-dimensional surface 
under study. In that situation, it was noted that 


VaVe = Ve Va: (5.7) 


In a more general context, an m-dimensional curved space may or may not 
be embedded in a higher dimensional flat space. In this more general context, 
a differential operator that satisfies Eq. (5.7) is said to be torsion-free. The 
operator V, is uniquely defined by Eq. (5.7) and four other conditions which 
are even more compelling. The five conditions are: 


(1) V, acting on a scalar or the tensor component of any p-vector 
coincides with the derivative 0/0u*; 

(2) V,yg = Veva (V, is torsion-free); 

(2) V, yg is a linear combination of the y,’s, that is V,yg = I,5*y,; 

(4) if of and B are any Clifford numbers with differentiable components, 
then 


V(t + B) = Vt + VB; 
(5) V, satisfies the Leibniz property, that is 
VALAB) = (V,A)B+ A(V,B). 


To see that these five conditions do indeed define V,, we need only see 
how these five conditions determine a formula for the Christoffel symbols. 
Since 29,51 = Yap + ygy,, we have 


V.2g,51) = Vale + Ve¥a)- (5.8) 


Using condition (1) on the left-hand side of Eq. (5.8) and conditions (4) and 
(5) on the right-hand side, we get 


óg, 
2 P I = (V, Ya) Yg YR) + VaYs Yp) + (Wp) Ya: 


Now applying condition (3) on the right-hand side, one has 


0g. 
2 a I= Ius zb Yn Ys) + D" yy, + YaYa)> 
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or 


0g, 
F = Ta gg + D, Jar (5.9) 
By cyclic permutation of the unsummed indices, we obtain two other 
equations: 





0 
Sat = Tuas Tap (5.10) 
and 
09 va ; à 
2 Qu! = —Ts, I na ES I, Gyn: (5.11) 


From the torsion-free condition, I," = D,,". Thus if we add Eqs. (5.9), 
(5.10), and (5.11), we get 


0g, OG gy Og, 
Gap , Ie g 


21," = ; 
va Ing Qu' Qut = du? 





Multiplying both sides of this equation by g^^/2 and noting that g,,9^^ = 64, 
one immediately obtains the equation 


AB à 
n; -2 (2e LIR on (5.12) 





2 \ ôu” ðu” dub 


The operator V, may be applied to any Clifford number with differen- 
tiable components including contravariant Dirac matrices. Since the contra- 
variant Dirac matrix y’ is a linear combination of covariant y,’s, it is clear 
that V,y* is a linear combination of y,’s and is therefore also a linear 
combination of y”s. Thus V^ can be written in the form A,,^". It is not 
difficult to show that A,,’ = —I,,/. To prove this identity, we observe that 

0 = V,(2641) 
= Vy, + Yay?) 
= (Vy yy,  yv( VP) Y, Is) + (Oy)? 
= Ay n Hat) + Lo, qo 
= 2A, (0,1) + 2T, (5I). 
Therefore 0 = A,,? + T,,° and 
Vy, = — VP y". (5.13) 


It is important to observe that the Christoffel symbols do not transform 
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as tensors. From Eq. (5.8), we have 
(294 I) VO yg I ) Ya): 
7 y B B 


It was shown that this equation essentially defines V,. Therefore under a 
change of coordinates, V, behaves in the same manner as 0/Ou*. That is 




















— ĝu" 
Va = aaa Var 5.14 
oat" (5.14) 
Thus 
Ü du" au? 
V5? =| — V, Hy 
Gs Xe s) 
and 
—T,,°7* = _ du" out vu ou" ^u? 
d ou" Qu' ™ a Gu" ou” 
It then follows that 
Lr Boa = Qu" Ou" Qu^ aa oa? -A 
"coo oa ôu ont " ^ — Qu* Q^ Gu" ôu” 
and finally 
p» _ Ou" du! Ou", du" dur Ouf (5.15) 


^  Qu*ow'Ou^ " — Qü*0ü^Qu' Qu" 


Equation (5.15) can also be proven with more grief directly from Eq. (5.12). 

Although Christoffel symbols do not transform as tensors, we see from 
Eq. (5.14) that V, does transform as a tensor. It is understood here that 
of is a Clifford number that is index free, that is ef has no unsummed indices. 
For example, suppose .e/ = Ag, y^ YY, Then 


Vos. = (0A5,"/0u*) y) y" v, — Ag," (T, VY Yn 
= Ap, 9 (D, 9^), * Ag, )? Y (Ta^). (5.16) 


Introducing the comma notation, we can write 


0A," 
— m Ay," a 5.17 
Qu* pes ( ) 


FOCK-IVANENKO 2-VECTORS 89 
Relabeling some of the dummy indices in Eq. (5.16), we now have 
Va = (Agy" a — A,y"T yg" = Ap Ta + Ap, Tua") y? Yy (5.18) 
This motivates the introduction of the semicolon notation. In our example 
Apa = Agy a — An Tag — Agi yy? + Ag, uj. (5.19) 
With this notation Eq. (5.18) becomes 
Val = Apra Y 1). (5.20) 


Both of the entities V, and yyy transform as tensors and therefore 
Ag,",, also transforms as a tensor. That is 


— , Of du’ du? ðu” 
ip = way A AL ; 
"P ^ oi" pū? du" gge P 





(5.21) 


For many years A,,",, was said to be the covariant derivative of the tensor 
Ap". In recent years some people have gone to great lengths to formulate 
matters in a way which would avoid reference to anything which could be 
interpreted as a specific coordinate system. To pursue this end, a slightly 
modified terminology and notation is used. In Spinors and Space-Time by 
Penrose and Rindler (1984) and General Relativity by Wald (1984), an 
“abstract index” is used. In our example the Clifford number Z would be 
labeled 4,7. The indices b, c, and d merely indicate the type of tensor 
components in the index free Clifford number and do not refer to any 
coordinate system. In the same spirit, y*V, is labeled by V, and is referred 
to as the “covariant derivative". In this context A,,’,, is said to be a 
“component” of the “tensor” V, A, A. 

For our purposes, it will be more useful to use the traditional ter- 
minology. To extend the notion of covariant derivative to coordinate Dirac 
matrices, we require that the covariant derivative have the Leibniz property 
and we also require that if æ is any index free Clifford number then 


V,A = sf... (5.22) 
This means that if Z = A^y,, then V,(A*y,) = (Af Yg), a or 
AP. ag + APT, Gy, = AP yg + AP Ypa (5.23) 
Relabeling some dummy indices on the left-hand side of Eq. (5.23), we have 


(Af, + AT on) Yg = A? yp + AP pa 
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Since Af, „ = A? , + A"T,,? and since the A^'s are arbitrary it follows that 
Yg;a = 0. (5.24) 
In a similar fashion, it can be shown that 
yg =O. (5.25) 
Problem 5.1. Show that A* , is not a tensor. 
Problem 5.2. Use Eq. (5.24) to show that g,,,, = 0. 


Problem 5.3. Suppose A*!*? transforms as a tensor. Show that 
Pp BiB2...Bs 

the contracted tensor A**? 9-14, ,, g, ,4 also transforms as a tensor but 

as one of lower order. 


Problem 5.4. Show that 


QA 
A, tof = C yl 
sp? Qu d 


Problem 5.5. Show that the result of Problem 5.4 can be generalized to 
higher order p-vectors. That is 
A 


yere: an — A arn +n 


AIAZ... An B A142. Ans 


5.2 Connection Coefficients and Fock-Ivanenko 2-Vectors 


So far, we have focused mostly on coordinate systems of Dirac matrices. 
However, for many computations, it is useful to use noncoordinate systems. 
For systems which are possibly noncoordinate, we will use Latin indices and 
we will continue to reserve Greek indices for systems which are specifically 
coordinate. For a possibly noncoordinate system, one has 


Vy, = WVW E Yp) = WSWE,ty, + WS WE vyg. (5.26) 
We define I," by the relation 


Vjyy = Vs" yn. (5.27) 
These generalized Christoffel symbols are called connection coefficients. For 
a coordinate system that is torsion-free, I,," = Ip”. However, for a non- 
coordinate system, it is generally not true that I," = I". Therefore, it is 
important to pay attention to the order of the lower indices in the connection 
coefficients. Traditionally those using the comma and semicolon notation 
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have used the order opposite to that used in Eq. (5.27). However, those using 

the V, notation find the sequence adopted in this text to be more natural. For 

this reason, I have followed the convention of Penrose and Rindler (1984). 
From Eqs. (5.26) and (5.27), we have 

Dy" y, = WI WEW Tap" y, + WWE Wm ym. 


J 


and thus 





0s 3 09 va OGap 


Ty" = 3W: WE Wg” 
jk ;W (g (s Ou B ou’ 


2) +W Wọ Wi, (5.28) 
We now observe that 
wig” = Wwrwi wig’) = oF Wig" = Wig™. 


Thus Eq. (5.28) becomes 





r,"-iwiwt wa (eeu Pa + oe _ see + WIWEWE, (529) 
u 
or 
D," = AW: WEW:g"*(A) + WW? WE, (5.30) 
where 
ð 
A= RU Irs) F a5 07 Wg,) — ap Ws). 


To reorganize this formula into a standard form, it is necessary to introduce 
the notion of commutator coefficients c". These coefficients are defined by 
the relation 


(0,0, — 0,0) = cs" Om (5.31) 


where 





40 
ô= Wi out 


From this definition, a small amount of scratch work will show that 
= WrI(0Wi-—0,W!). (5.32) 
After a substantial amount of manipulation Eq. (5.30) becomes 


n"- 39" (05g, + Og, — ag) + 39" (Cox + Cika — Cigk) (533) 
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where 

Cikp = ÜpnCjk - (5.34) 
We now define I", by the equation 

Dep = Ipm x” (5.35) 


It should be noted that this is not standard notation. Usually when the upper 
index of a Christoffel symbol or connection symbol is lowered, it is lowered 
into the first position. That is Tkp = g;,L*,,. I have chosen Eq. (5.35) to 
define Fp because it is more compatible with the V; notation. With this 
convention, it follows from Eq. (5.33) that 


Is s 10,91, + OG sp = 6G jx) + Xy + Cikp — Cjpk)- (5.36) 


Suppose an orthonormal system of Dirac matrices is chosen. In that 
circumstance, the Dirac matrices can be regarded as constant with respect 
to ordinary differentiation, that is 


ô 
Qu* 





Ve = 0 : 
Then 


A 0 ^ 
0), = Wiss" - 0. 


In this situation, the first three terms in Eq. (5.36) are zero. In that case 
Vg = Xy; + Cirp — Cjp)- (5.37) 


In this context, the connection coefficients are known as Ricci rotation 
coefficients. From Eqs. (5.32), (5.34), and (5.37), it follows that 


Dap = -T 


j jpk* 


(5.38) 


This symmetry can result in substantial savings in computation. In an 
n-dimensional coordinate system where I," = Fg,” there are as many as 


n . : ; 

"ns distinct nonzero Christoffel symbols for which « # f and n? more 
symbols for which « = f. On the other hand for a constant frame there 
are only ; n Ricci rotation coefficients to compute. For the usual 4- 


dimensional space-time manifold of general relativity, this means computing 
24 entities instead of 40. Furthermore, if the coordinate matrix tensor gag is 
diagonal, even further savings occur. In that case the constant frame can be 
chosen so that W^? = 0 unless a = a. From Eqs. (5.32) and (5.34), it is clear 
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that for this case cj, = 0 unless p = k or p =j. From Eq. (5.37), Ty, = 0 
unless at least two of the indices are identical. Because of the antisymmetry 
of the last two indices T, = 0 if k = p. Thus the only non-zero connection 


jkp 
coefficients in this case are of the form I, or Tp. Since T; = —I75,; and 


JJ. JPJ' 
p £j, there are only n(n — 1) nonzero coefficients to compuée. For the 


4-dimensional space-time manifold of general relativity, this means a maxi- 
mum of 12 non-zero connection coefficients when the coordinate metric 
tensor is diagonal. 

To take full advantage of these symmetries, it is useful to apply the 
notion of Fock-Ivanenko 2-vectors (Fock and Ivanenko 1929; Fock 1929). 
A Fock-Ivanenko 2-vector T, may be defined by the equation 


L, = iW. (5.39) 


What makes these 2-forms useful is the relation that if f, is a member of a 
constant system of Dirac matrices for which 5, , = 0, then 


Vk == atk + 2M. (5.40) 
To verify Eq. (5.40), it is useful to note that 


y? ytyy = VPO Ve H VVI) — Oy + yy)? + VEY? Y* 
or 


P” Yk — yy" = 2y? Of — 27° Of. (5.41) 


Thus 


-Tafe AT, = EWT p — 199 
—MWAT, (^81 — $5) 
= WC TP + Taufe) 
= WI, ikq ^ 

= Wil, 

= WV), 


= Vis. 


As will be shown below, Eq. (5.40) can be generalized in a nice way to any 
Clifford number. Meanwhile, we wish to establish a simple formula for I. 
From Eq. (5.29), one sees that 


ow? 


0g,s i OG on OG up ; i. 
u 


I. 
Qu^ Qu? Qu 


ijp 








-amwyn )+ nmp Wt Wp 
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It then follows that 
T, = iW, ?^ 


ôg ôg 
= ODVE wr De 4 Sre 3 
at X j (2 Qu au” 


Pe (ee ee 
F 16D Pn e (W5 W?) F^ (3 ĝu” r) "|. 








Therefore 


r=} 


a 





Qu* du Qu 
Since yf” = —5". it follows that 7^'(0g,,/0u*) = 0. Thus we have 


0g og 0 
rT olafv Y va aß 148 ^ 
a = 8) (2 ga) ay ow Ye 


»f ĉde _ Iva — 99. i 
( , d 402 WI ^; Wigs). 








and finally 


Ge 1, 9 
i 5.42 
PT iy Nar (5.42) 


T, = 4” 

In the context of general relativity, the orthonormal frame is known as 
a Vierbein. The definition of 0/0u* and therefore T, must depend on the 
choice of the Dirac system of orthonormal matrices or Vierbein. The first 
term on the right hand side of Eq. (5.42) is independent of the choice of 
orthonormal frame. Sometimes it is possible to choose an orthonormal frame 
that will make the second term zero. This is possible in the case of the 
Schwarzschild and Kerr metrics. 

If the metric tensor for the coordinate system is diagonal, then one can 
simplify this formula even more. In particular the first term is simplified and 
the second term becomes zero if one chooses the constant orthonormal frame 
to be parallel to the coordinate frame. In particular, we can let 


Y= )ulgagl* = gp?" 


where f = b = 1,2,..., or n and no sum is intended. Then 











ES ad | 
eit = fa*lggg! * eE 
ðu” 
ens ey Ipe v 
— 27 
lg] Qu* 
0g 
= zy = 


dur 
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It then follows that 


ð Ó 
ys = 30" ^ Y) 7 


By 
^ ĝu” ĉu 





955 — 0, 
x 


For the first term in Eq. (5.42), we merely note that in the summation over 
the v index g,, = 0 unless v = «. Thus if g,, is diagonal, we may write 


0 


|» = 2 Qu Jaa 


(5.43) 


where the fj is summed but the g index is not. 
To generalize Eq. (5.40), we first note that 
V,(j*) = V,(n$,) = n° V, 
= n*( 1,5; + 90.) 
and thus 
V^) = -rnj* s ?T,. 


For a product of orthonormal Dirac matrices, we get some nifty cancella- 
tions. For example 


and thus 


^ 


VA o) = —1,0*5,3,) + OSIE 


For a Clifford number of the form A,?2*,?,, we would have 





XA OA, aka a mor eae 
VA $F) = F 9/9, — Li Ay $959.) + (Au $*$;$,)T,. 


From these illustrative examples, one can see that 


ô 
ðu” 





VE J=<4l I1-EL J+ JE (5.44) 


where any differentiable Clifford number can be inserted in the square 
brackets. 
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If one wants to use some directional derivatives associated with some 
noncoordinate frame, one can define 


T, = W2Y,. (5.45) 
Multiplying Eq. (5.44) by W% and summing over the « index then gives us 


VL 1-294, 1-LEL J+ 1E. (5.46) 


A comment should be made here. All equations and remarks that were 
made for Fock-Ivanenko 2-vectors using an orthonormal system of Dirac 
matrices for which 7, , = 0, remain valid for any constant frame. This point 
was not made in the above text because of the limitations of the notation 
that I have selected for this book. 

In closing this section, I would like to express the opinion that use of 
Fock-Ivanenko 2-vectors usually provides the most efficient method of 
computing the Riemann curvature tensor. This point will be pursued in 
Section 5.4. 


Problem 5.6. For the 2-dimensional surface of a sphere, gay — r?, Joo = 
r? sin 0. Use Eq. (5.43) to compute T, and T}. 


Problem 5.7. Use Eq. (5.36) to show that 


Chik = D m T ak (5.47) 


Problem 5.8. Define [0,, 0,] = 0; 0, — 6; 6,. 
(1) Show 
[Lé,, 6,], 0] + EE0,. ôk], 0,] + LEA, 0,], 0,] = 0. (5.48) 
(2) Use Eqs. (5.31) and (5.48) to show that 


CP cy + cu c" + Chi” Cy,” = 0. (5.49) 


ij 


(Equations (5.48) and (5.49) are known as Jacobi identities.) 


5.3 The Riemann Curvature Tensor and its Symmetries 


For a coordinate system of Dirac matrices, the Riemann curvature tensor 
R,5," may be defined by the relation 


(VV; = VeVi) Yy uni Raps ys (5.50) 
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(This is the convention used in Spinors and Space-Time, by Roger Penrose 
and Wolfgang Rindler (1984, p. 200). However, it is not standard for the 
simple reason that there is no standard notation. Virtually every conceivable 
variation has been and is used.) 

Because of the V,'s that appear on the left-hand side of Eq. (5.50), it is 
surprising that R,,," is a tensor. However, this entity does indeed behave as 
a tensor under a change of coordinates. (See Problem 5.9.) On the other 
hand, Eq. (5.50) must be adjusted for noncoordinate systems. If we require 
that 


Ry " Wi ws Wi We Rapy" (5.51) 
we then find that R,” must satisfy the equation 
(VV, — ViVi) Ye — €," Vin Ve = Rijk” Ym (5.52) 


where the c;;""s are the commutator coefficients defined by Eqs. (5.31) and 
(5.32). 

It is useful to state and then prove some symmetries for the indices of 
Rijem- First, R, xm is antisymmetric with respect to its first two indices. That is 


Res = = Rika (5.53) 


The tensor is also antisymmetric with respect to its last two indices. That is 
Rig, = — Rome (5.54) 
Furthermore, there is a cyclic symmetry in the first three indices. In particular 
Ry jem + Ryim + Riam = 9- (5.55) 


(This is generalized in Prob. 5.12.) Finally, the curvature tensor is symmetric 
with respect to an exchange of the first pair of indices with the second pair: 


Ry jem = Rims- (5.56) 


We will now verify these symmetries for coordinate systems and then 
note that the same symmetries immediately follow for noncoordinate frames 
since 


Ris, = W2W! WEW? Ragin: (5.57) 


ijkm 
To verify that R,,,, = — R5,,,, we merely reexamine Eq. (5.50) which 
was used to define the tensor. 


To show that R —R is more difficult. To obtain this result, we 


afvq T afiuv 
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make use of the fact that V, acts on g,, like a partial derivative. Thus 
(VaVe — VeVi) Gund = 9. (5.58) 
This means that 
(V, V; — VV vy, + yy.) = 0. 
Applying V; and then V, to the product y,y,, we find that 
V, Vg Yv Yn = (Va Vg Yv) Yn + (Ver (Varn) 
+ (Va Yv (Vg Yn) + Y(VaVe%n)- 


A similar equation can be obtained by reversing the « and f indices. Taking 
the difference of the two equations then gives us 
(V, V; "ow Vs Va) Yv Yn = ((V, V; Ea Ve Va) Y») 7n 
F ?(V, Vg PX Vp Vu) Vn) 
= Rage YoYn T Rapa VvYa (5.59) 


Now we can reverse the v and 5 indices in Eq. (5.59), and then add the 
resulting equation to Eq. (5.59). One then has 


(V, V, NE V; V,)2gy,1) = Rags gs) + Rag, (29 vel). 
Carrying out the summations over the o index and using Eq. (5.58), one has 


0 = 2Ry py, + 2R 


apyyn apny? 


and this is equivalent to what we set out to prove. 
To obtain the cyclic symmetry, we make use of the torsion-free condition. 
That is Vy, = V,y,. This implies that 
Va Vg?» n VV. 96 TÓ 0, 
VV y. us Vg Vos =| 0, 
and 
V,V.yg Pu V, Vy. = 0. 


Adding these three equations and then regrouping the terms together gives 
us the equation: 


(V, V; Tx Ve Va) Yy + (V; V, Fn V, Vg) Ya + (V, Va = VV.) ¥p = 0, 
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Or 
(Rag, + Roya’ F Ryag Yo =0. (5.60) 
Multiplying both sides of Eq. (5.60) by y, from the right gives us a new 


equation. Multiplying Eq. (5.60) by y, from the left gives us a second new 
equation. Adding the resulting equations gives us 


(Rags? ap Reva’ + Ryg (2G nol ) = 0 
or 


Ragvn + R bvan F R yapn =0. 


To get the last symmetry, we add up four versions of the cyclic symmetry 
equation. That is 


Rapyn + Revan + Rvagy = 9 
R ganv + Rangv + Raga, = 9 
—Rvnap — Ry — Rag = 0 
—Rvpa — Ragga — Ronva = O. 


When we add these last four equations together, we make use of the fact 
that Rim = Rjj,,. The result of the addition is then 


Ragen — ZR yap = 0 


af vn 


which is what we set out to prove. 
A tensor which is closely associated with the Riemann tensor is the Ricci 
tensor R,, which is defined by the equation 
Rij = Ri". (5.61) 


It is not difficult to show that the Ricci tensor is symmetric with respect 
to its two indices. We merely note that 


Ry = Rip? = Rig = R jig? = Rigi = Riv (5.62) 


U J Jt 


The Ricci tensor plays a fundamental role in Einstein's theory of general 
relativity. This point will be discussed in Chapter 6. 


Problem 5.9. Note that 


= = Qu? Qu? Qu? Qu? Qu^ 
V,V, — VV), V, Voss ve V, 
eiT [e je ) e X NF 7a) 
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(1) Use this relation to show that 


Ou? du? Qu" 


VV, — VV), = — — 
(GV, = VV. = ncs oi aa 





(V; V, y V, V3)y,- 


(2) Use the result from part (1) to show that R,,," transforms as a tensor 
under a change of coordinates. 


Problem 5.10. 
(1) Show that 


WiWÜWgV,Vs — Va Va) Yy (VV, — ViVi Ye — €" Vn Pe 


where the c;,"s are the commutator coefficients defined by Eqs. 
(5.31) and (5.32). Suggestion: first show that (V, V; — V, V AOW kY) = 
Wi(V,Vs n V; V» 
(2) Using the result from part (1), show that if 
Ry" = WIWE WW Reg, 
then 
Rijk” Ym = (VIV, — VIV) — Cir" Vai 


Problem 5.11. Use the relation that Rygy, = R,,,, and Eq. (5.57) to show 
that Rok = Remy: 


Problem 5.12. In this section, it was shown that the Riemann tensor has a 
cyclic symmetry in the first three indices. Show that the Riemann tensor has 


the same cyclic symmetry in any of its three indices. For example show 
Ri em + Ritm; + R myk = 0. 


Problem 5.13. 
(1) Use the Leibniz property of V, to show 


(V, V; BN Vp Va) Ya?” " L(V, V, m V, Va) Ya ]?” + Ya(Va Vg ds Ve Va)”. 
(2) Using the fact that (V, V, — V,V,)ó; I = 0, demonstrate that 
(V, V; — VeV) Y” = — Rag, Y 


(3) Show 
(V, Vy — V V,) A" yy ys Y" 
= (A Ra! + AY Rapa’ — ATRas Vy Vo? 
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Problem 5.14. Use the contravariant version of Eq. (3.31) to show that 
Risen Y Y = = 2R uid is 


Problem 5.15. Use the contravariant version of Eq. (3.32) to show that 
Rs) 77 y" = —2RI where R = Rj = Rag. 


Problem 5.16. Use Eq. (5.52) and the relation V;y; = I;j"y, to show that 


Rye ol esL Tyee. 883) 


JP 


Problem 5.17. Use Eqs. (5.47) and (5.63) to show that 


Ran = OT jem md Oi Vis, + DTT: 


ipm 


— TP Vom — LPT pem + UU em (5-64) 


JP 


Problem 5.18. Show that 


Ani — Ane = Rgp AT. (5.65) 


5.4 The Use of Fock-Ivanenko 2-Vectors to Compute Curvature 2-Forms 


In Section 5.2, we were able to exploit the antisymmetry of the last two 
indices of the Ricci rotation coefficients. In particular that antisymmetry 
enabled us to express all differentiation relations for Clifford numbers in 
terms of Fock-Ivanenko 2-vectors. It is possible to exploit the antisymmetry 
of the last two indices of the totally covariant version of the Riemann 
curvature tensor in a similar way. 

With that intent, we introduce the notion of a curvature 2-form. The 
symbol &;, will be used to designate a curvature 2-form where 


Rij = aR seals (5.66) 
It is a slight abuse of terminology to refer to 22;; as a 2-form rather than 
a 2-vector. However, it is the Clifford algebra analogue of the 2-form that 


appears in the theory of differential forms. 
For Fock—Ivanenko 2-vectors, it was shown that 


Ij, = Yi)" = T; fk + AE; 
and thus 
Vii, = “Dit + $T 


In a similar fashion, it can be shown that 


Rik Ym = Ry jm Y= -iR Yk + WGR) (5.67) 
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and thus 
(V,V, T V,V,) Yk = Cy Vin Vk = — 180, y, E NGR). (5.68) 


It is left to the reader to show that to verify Eq. (5.67), one merely uses Eq. 
(5.41) in much the same way as it was used to verify Eq. (5.40) for 
Fock-Ivanenko 2-vectors. 

Equation (5.68) may be generalized in essentially the same way that Eq. 
(5.40) was generalized for Fock-Ivanenko 2-vectors to obtain Eq. (5.43). The 
only significant difference is that a tensor coefficient of a p-vector commutes 
with the curvature operator. For example 


(V,V, Pa V,V, mm C," Vs) Apa Y" Yes = Apa (VV, P VV, — Cif Vs) Y Ve Vt (5.69) 
In this fashion the generalized version of Eq. (5.68) becomes 
(VV,- VV -e"VE 1-9 -$2,[ 1+0 1G8,) — (6.70) 

where any twice differentiable Clifford number can be inserted in the square 
brackets. 

Perhaps one should note that if a tensor multiple of J were added to 
Rij Eq. (5.70) would remain valid. On the other hand, if we require @;; to 
be a 2-vector, it is unique. 


It is now not too difficult to obtain a simple formula for curvature 
2-forms in terms of Fock-Ivanenko 2-vectors. Using the fact that 


Vyyy = —T,yy, + Yu. 


one can show that 


(V,V, V;V, zn Ci" Vin) yy = —(V,T; = VT; + TT, g TT, ~ CT) Ye 
Tou, — VP, + 0T,- TT; — c," Dy): 


Comparing this result with Eq. (5.70), one would guess that 
R, = VT, — VT; + ET; — ET; — eTa. (5.71) 
The only thing left to verify is that the right-hand side of Eq. (5.71) is a 


2-vector. This is left to the reader. (See Problem 5.19.) 
An alternate form of Eq. (5.71) may be obtained by noting that 





wn-wil 
ĝu” 


= ôF, — T;T, 4L. 


T, - CT; + CT, 
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we then get 

i90, = 9T; x A coo LT; + rr, = c; Tm (5.72) 
(Comparing Eq. (5.71) and (5.72), it is important to observe the difference 
in signs that occurs in front of the term (LF, — F,F;).) 

Equations (5.71) and (5.72) are probably most useful for coordinate 
systems for which the commutator coefficients are zero. In the case of the 
4-dimensional space-time manifold of general relativity one has only four 
Fock-Ivanenko 2-vectors to deal with in place of forty Christoffel symbols. 
This fact will be exploited in Chapter 6 to compute the Schwarzschild 
metric. 


Problem 5.19. 


(1) Use the covariant version of Eq. (3.34) to show that 
ptc pP — opto qam qe gem — geye + gge, 
(2) Use the result of part (1) to show that 
myth = —2gyE + 2998 + 2g y ages 
(3) Use the result of part (2) to show that 
GT; — LET; is a 2-vector. 


Problem 5.20. Use Eq. (5.72) and the result of Problem 5.6 to compute the 
curvature 2-form $22, for the 2-dimensional surface of the sphere. 


Problem 5.21. A metric important in the study of cosmology is the Friedman 
metric defined by the equation: 


ds? = dc? — (a(t)? [dx? + sin? y(d0? + sin? 0 d$?)]. 
Use Eq. (5.43) to show that 


L-0 T, = 27"a)a'(a), 
T, = 3y*'aa' sin? x — 3y*°a sin x cos x 
and 
E, = 3y*'aa' sin? y sin? 0 
+ 1y?*a? sin y cos y sin? 0 


— 4y%a? sin? y sin 0 cos 0. 
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Problem 5.22. Use the results of Problem 5.21 and Eq. (5.72) to show that 
for the Friedman metric, the curvature 2-forms are 


4, = —y"aa", Ro. = — "aa" sin? y, 


Ry. = — "aa" sin? y sin? 6, 
Roy = 7" (1 + (a')?)a? sint y sin? 6, 
Ry, =U + (a')?)a? sin? y sin? 0, 


Ro = "(0 + (a'))a? sin? y. 


Problem 5.23. An important class of metric tensors can be characterized by 
the relation y, = 7, — mw, w” >, where wk = n'^w,, wwen” = 0, 94% = ys DDA 
ny = (9, 9,5», and m is an arbitrary constant. A metric tensor of this form 
is said to be degenerate. 


(1) Show that g, =n, — 2mw,w,. 

(2) Show that the matrix g^ = n^ + 2mw/w* is the inverse of gx- 
(3) Show that g/?w, = n’?w, = w’. 

(4) Use Eq. (5.42) to show that 


m ô 
r= 75 y au! (wiw,). 


5.5 The Interpretation of Curvature 2-Forms as Infinitesimal 
Rotation Operators 


In the last section of Chapter 4, an index free Clifford number F was 
said to be “parallel transported" along a path u%(s) if it satisfied the 
equation 


VE =V, F =0. (5.73) 


As an index free Clifford number is parallel transported along a 
path, it will maintain a constant magnitude. An observer moving along the 
path with a parallel transported vector would observe no rotation. However, 
in general the vector would undergo a rotation with respect to the coordinate 
frame. Or perhaps one should say that the coordinate frame rotates with 
respect to the parallel transported vector. At any rate this relative motion 
may be described by a rotation operator, that is 


F (s) = Rs) FOR - !(s). (5.74) 
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This implies that 


dF (s) - (#8 


1 
)Foa (s) + Ws) F0) 92 _ 
ds d ds 











= (20) RSR) (0)22 - !(s)) 
" ] Th 
+ (Rs) 7 ORT Gat) È 
Therefore 
FO (4° a c) + F(A) mE eue) 
ds ds 
But 
-1 
9-— “(AG \R-\s)) = 22 T TRO a-i) + RUS) =. = 
So we have 








E (4° a ro - Fs) (e RS) (5.76) 
ds ds ds 


On the other hand, from Eq. (5.73) 


du* du*0F dw du* 


0= F = F +F 


ds ^ ds Ow ds * ds * 





or 





ire a du* 
dF db g 2" 





B Y I. (5.77) 
ds ds ds 
Comparing Eqs. (5.76) and (5.77), we have 
d&(s) du" 
-——DL2(s 5.78 
d d: (s). (5.78) 
For small displacements 
RAs) = (2 + As D, AS x) (5.79) 
s=0 





106 CLIFFORD ALGEBRA 


TM ut = u%(2) = u%(1) + As 





If 4 #a and) £8, then u^ = U^(1). 


Fig. 5.1. A coordinate loop ın the surface obtained by setting all coordinates equal to various 
constants except u* and u^. 


We note that 2(0) = I and so from Eq. (5.78) 
































dA(s _ du’ 0) 
(s) T0). 
ds s=0 
Also from Eq. (5.78) it follows that 
d'R(s) - (S Lyra - dL, S, 
ds? ds? ds = ds 
So 
PRO _ (=) du’ dT, du” du" 

ds? |.. ds? ds ds ds ds '" 





where everything on the right-hand side of this last equation is understood 
to be evaluated at s = 0. Equation (5.79) becomes 


du’ d?u" du” dI, du’ du” 
R(As)=I +A T, - i(A ili Jr Y4 rr, 5.80 
(As) 5 ds 2(As) ds? ds ds ds ds epe 











where again it is understood that everything is evaluated at s = O. 

Now let us consider the result of parallel transporting a Clifford number 
around a closed path. Consider the path 1 — 2 — 3 > 4 — 1 shown in Fig. 
5.1. For the first leg of our journey, 


w dr du =0 for vs a, 
ds ds 
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and 
dr, cT, 
ds ĝu" 








and 


a(l) 


Ry... = I + AsT,(1) + X(As y TO + ECOL, | (5.81) 


where the argument “1” is used to indicate the point at which the Fock- 
Ivanenko 2-vectors are evaluated. 
For the second leg of our journey, 














du? du” dr, oF 
— x 1, =0 for v x P, =Z, 
dt dt P dt ðu! 
We then have 
20,.4- I + At) + kay E s2) | (5.82) 


We wish to express this last rotation operator in terms of Fock-Ivanenko 
2-vectors evaluated at our point of origination. With this in mind, we note 
that 


T,(2) =T,(1) + As 





d¥,(1) ôr) 
= P(1 As ———. 
ds ptas Qu* 


Similar expansions can be made for 0T,(2)/0u? and I,(2)E,(2). However, we 
only need the first term in each of these two expansions to retain a 
second-order precision in 22; ., 4. With these remarks, we now have 


8.3 = I + MES) + AsAt a) ) 4 At » | a DD roro |. (5.83) 


To obtain the rotation operator for 20, 2-3, we carry out the obvious 
multiplication and retain terms up to second order. We then have 


r 
R, an3 = Ra nR, na Le AST, + AT + Ash CEE eur 
u 
T, r 
+ as E Hrg |+ Et Pen (5.84) 


where everything is evaluated at point 1. 
We could continue this procedure through point 4 and then on to point 
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1. But this gets cumbersome. Instead, one can obtain 22, 4-3 by formally 
switching As with At and a with f in the formula for 28, „3—3. The result is 


| 


eas 2 T TIE + (AD? E TEN, jr. (5.85) 


oT, 
48,5453 — I t AST, + AIT; + asa é : 
u 





To obtain the inverse of 22, „4-3, we note that (Z + ô)! = 1 — ô+ ô? 
where in our case, ó — all the garbage on the right-hand side of Eq. (5.85) 
except for J. Carrying out the arithmetic and retaining terms only up to 
second order, we have 





oT, 
40,454, = I — AST, — Atl, as 2s nx. 
ot, 
— (A5)? a 
2(As) E 


Finally to get the rotation operator for the entire closed path we have 





or 
1 = kh E — |. (5.86) 


9,525354217 B34 1B 4243 











a, or, 
= I + AsAt| — — TID 
E ôu f | 
= I + AsAtGB,,). (5.87) 


From Eq. (5.87), we see that the curvature 2-form may be interpreted 
as an infinitesimal rotation operator which acts on a vector or any Clifford 
number that undergoes parallel transport around a small closed loop formed 
by segments with coordinate directions. 

This result can be generalized to loops consisting of segments with 
noncoordinate directions. 

For example one can first parallel transport a vector from point 1 where 
u* = u*(1) to point 2 along a path defined by the equation 


ius = Wi fora=1,2,...,n 
ds 





At point 2, u*(2) = u*(1) + W?(1) As. For this first path segment, we get 


L1) 


28,5 = I + AsT(1) + (As | SO di + reor 
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ut = u%(1) + Wy (Hat 






ut = u%(1) + We (4)4s + Wy (tat 


3a 
ut = u(1) + W^ (1)4s + Wit 





u“ = v (1) ut = ut(1) + Wi (1) 


Fig. 5.2. Starting from point 1 and moving first 1n direction i and then in direction j, one arrives 
at point 3a. Starting again from point 1 and moving first in direction ; and then in direction i, 
one arrives at point 3b. 


where 


(^ 


dE(1) | du* oT; at ar; 
ds  dsów  ' ðu" 


= aT; 











The second leg of our journey is governed by the equation 


du* — j 
dt di 


For this leg, we get 


T, 
Raza = I + At (2) X ES (2) 


+ TQ) ro 
In a fashion very similar to the coordinate computation, we have 
dr; du* ôr, (1 
r =r) + As SEO = ray + as! GO 
ds ds Ou* 


— F1) + As wee 





oo D (1) + As GT (1). 


So far this computation is almost identical to that carried out for a 
coordinate loop. Now we encounter a difference. At the end of two legs 
of our journey, one arrives at a point that is labeled by 3a in Fig. 5.2. The 
coordinates of this point are 


u* = u%(2) + ALW Q2) = u*(1) + As WFC) + AtW3(1) + AsAt ô; WSCA). 
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When we switch As with At and i with j, we do not arrive at the same point. 
We arrive at the point labeled 3b in Fig. 5.2. This point has the coordinates 


u* = u*(1) + AsW#(1) + ALWS(1) + AsAt 6,W%(1). 
It thus becomes necessary to close the path by a short segment where 
Au* = AsAt(d, Wi — ô W5). 
From Eq. (5.78), 


AR = AwT, = Au* WT, 
—AsAt WEWE — WON, 
—AsAte,/'T,. 


Thus for this short segment, 
04.5 3b — I <7 AsAtc,*T,. 
With this correction, we get our desired result: 


Ry >2>3a>3b>4>1 = I t AsAt(o,T; g ôT; ES LT; T Lj; = ci Tz) 
= I + ASA). (5.88) 


6 


THE SCHWARZSCHILD METRIC VIA 
FOCK-IVANENKO 2-VECTORS 


6.1 The Use of Fock—Ivanenko 2-Vectors to Determine the Schwarzschild 
Metric 


About 10 years after his paper on special relativity, Albert Einstein published 
a series of papers establishing the general theory of relativity (1915a, 1915b, 
1916). According to his theory, the path of a small test particle is that of a 
geodesic in curved space-time. Furthermore the metric for a mass free space 
must satisfy the equation 


Rf -0 (6.1) 


where R;* is the Ricci tensor. At that time Einstein constructed some 
approximate solutions for this system which gave credence to his theory. 
Soon after this, Karl Schwarzschild found an exact solution which is of 
special interest (1916). For the study of planetary motion about the sun, one 
is interested in a time-independent spherically symmetric metric. Using 
various symmetry arguments, Schwarzschild hypothesized that the line 
element corresponding to such a metric should be of the form 


(ds)? = f(r)c^(dt)? — h(r)(dr)? — r?(d0)? — r° sin? 6(d9)?. (6.2) 


In this curved space-time, one can no longer use the same radial 
coordinate to designate the distance from the center and the circumference 
of a great circle about the origin divided by 2x. For the Schwarzschild metric, 
the radial coordinate represents the circumference of a great circle divided 
by 2r. This is easy to see. On the surface of a sphere with the center at the 
origin, the line element is 


(ds)? = —r?((d0)* + sin? 0(d9)?). 


This is of course the same as it would be in the flat Lorentz metric. The 
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circumference of the equatorial circle where 0 = 1/2 and (ds? = —r?(dó)? is 


2n 
| r d$ = 2nr. 


0 


On the other hand, to compute the radial distance between two values 
of r, we note that for dt = d@ = d$ = 0, we have 


(ds)? = —h(ry(dry.. 


Thus the radial distance from r, to r, is 
| (h(r))? dr 4 (r, — n). 


We now turn to the problem of solving Eq. (6.1) and thereby obtain 
explicit expressions for f(r) and h(r). From Eq. (6.2), 


cf o o0 0 
0 -hr 0 0 
gag — 0 0 EM 0 (6.3) 
0 0 0 —rsin?0 
and 
Mc) 0 0 0 
0 —1/h 0 0 
P se X aH 0 i a 
0 0 0 — 1/(r? sin? 0) 


From these two equations, it is clear how to construct an orthonormal 
system of Dirac matrices: 





1 1 
t= — f? = — Yn 6.5 
rag = ph (6.5) 
1 1 
r= fl = iy, 6.6 
pe p” (6.6) 
1 1 
P=- = —-73 Ye (6.7) 
r r 
1 





(6.8) 
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Since our metric is diagonal, we can use Eq. (5.43) to compute the 
Fock-Ivanenko 2-vectors. That is 


ð 


T,= p de 


a 


where the index f) is summed but « is not. From this, one gets 








2r! D 
T, =f f y= ci - $19, (6.9) 
4 4( fh)? 
T, =0, (6.10) 
pee ae (6.11) 
2 2h? 
r sin? 0 r? sin 0 cos 0 
T, =Z y? = vy? 





2 2 


_ sind oy, POR ud 


Soa 6.12 
2h? 2 oe 





To obtain the curvature 2-forms we use Eq. (5.72), that is 


ð é 
cu L-nn-rur, 
Qu* á P 


e pone. 
ia ^5 uh 


From this formula, we get 





in in [ s ge 


ra Afh)  &fh9 


= | Fos m d» (6.13) 
Afh) (fh? 





In a similar fashion 


1 


3989, = “ihr Yer (6.14) 


1 


Ry = — —-—— Yen 6.15 
2A or ifr (6.15) 


114 CLIFFORD ALGEBRA 


WK 
IRo = —— Yro» 6.16 
6 A(h)?r H 8 ( ) 
ig, = E (6.17) 
2A pr A(hyir Tor . 
ho} 
1 = 
2989, = 2hr? Veo: (6.18) 


From Problem 5.14 and Eq. (5.66), Einstein’s field equations can be 
written in the form 


YP Ron = 2R,,y? = 0. (6.19) 
Thus we have 


PR — Y' Ry +V Ro + y" Ron 
à J P Eog EE. f 
=< t 








ASh) ASh? Xfhr" xfhr" 
= 0. 
or 
f* TN. F 
B es (teas 3 = 
HR Ea Ah disp » in 


In a similar fashion: 


O PUY K 
^B um c x = 
Pw Iz Afi ax] x eg 


=f h' h—1 
i m — 
EE lx; h)r 5 2(hy?r TON fe 0, (6.22) 








and 





-f k h—1 
É = = 
D t ne J 0. (6.23) 


Subtracting Eq. (6.20) from Eq. (6.21) gives us 


E 
(h?r (nr 








=0 (6.24) 
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or 


Integrating, we get 


In(h) + In(f) = In(fh) = In(k) 


or f(r)h(r) = k. At long distances from the sun, our metric should approach 
the flat space-time Lorentz metric, so 


lim f(r) = lim h(r) = 1. 


ro roo 


Thus it is appropriate to set our constant k equal to 1 and we then have 


f(r)h() = 1. (6.25) 
From Eqs. (6.24) and (6.25), Eq. (6.22) becomes 


k h—1 K 1 


+ =0 eee 
Qr hr? SA. Se T 








From the partial fraction expansion, this becomes 


Dom E 2d 
h h-1 r ` 


Integrating this equation gives us 
—In(h) + in(h — 1) + In(r) = In C 


or 


r(h—1)_ 
rop 


C. 


Solving for h, we get 


h(r) = (1 = sy 5 (6.26) 
r 


From Eq. (6.25), we also have 


f= (: a 2) (6.27) 
r 
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The reader should also check that these formulas for f and h also satisfy 
Eqs. (6.20) and (6.21). Traditionally the constant C is designated by 2m, so 
the line element for the Schwarzschild metric becomes 


(ds)? = (: ~ aea - ( - 2”) (dr. — P[(d6)* + r? sin? 6(49)?1. 
r 
(6.28) 


In the next section, it will become evident that the constant 


where M is the mass of the sun and G is the universal gravitational 
constant. 

A singularity of sorts occurs in the metric at r = 2m. For this reason, 
2m is known as the Schwarzschild radius. In the study of black holes, one 
must consider the nature of physical phenomena for r < 2m. However, for 
the study of planetary motion this is not a matter of concern. The field 
equations used to derive the Schwarzschild metric are valid only in a mass 
free space. For the sun, the Schwarzschild radius is approximately 3 km, 
which is well within the surface of the sun which is the boundary of the mass 
free region. 

In this section, we constructed curvature 2-forms from Fock-Ivanenko 
2-vectors and then used the resulting curvature 2-forms to obtain Einstein's 
field equations. Conventional wisdom says that the most efficient method of 
obtaining curvature 2-forms for a not too complex metric is to first compute 
what are known as connection 1-forms via a "guess and check" method 
introduced by Professor C. W. Misner (1963, appendix A of paper). This 
method is used to compute the Schwarzschild metric in Wald's General 
Relativity (1984, pp. 51-52 and pp. 121-127). A similar calculation is carried 
out for the Friedman metric in Gravitation (Misner, Thorne, and Wheeler 
1973, pp. 355—357). It is my opinion that the method used in this section is 
both quicker and more straightforward. 

In the next section, we will compute the bounded orbit of a planet 
predicted by the Schwarzschild metric and compare the result with the 
classical Newtonian computation. 


Problem 6.1. Even in classical Newtonian physics, the Schwarzschild radius 
has an interesting significance. Suppose the mass of a spherical astronomical 
body (star, planet, or whatever) is so dense that the radius of the body is 
equal to 2M G/c?. Using the assumptions of Newtonian physics, show that 
the escape velocity for an object initially on the surface of such a body is 
equal to the speed of light. 


THE SCHWARZSCHILD METRIC 117 


6.2 The Precession of Perihelion for Mercury 


This section contains no applications of Clifford algebra. This section is 
included to give credence to the claim that there is some significance to the 
Schwarzschild metric that was computed in the last section. Readers who 
have a cursory knowledge of general relativity may wish to skip this section. 

For those who insist on not skipping this section, we will compare the 
equations of motion for a planet according to the general theory of relativity 
with the equations of motion according to Newtonian physics. 

In classical analytic mechanics, the path of a particle is that of the “least 
action” which is determined by the calculus of variation equation: 


du* du’ 
a |) imag SE vat a,u) [dt = 0. 


In the case of planetary motion in spherical coordinates, we have 


oll) <2) (ont te m 


To cast this classical equation in a form which will make it comparable with 
that of Einstein, it is useful to replace t by s — ct. If we also factor out a 
factor of 4mc*, Eq. (6.29) becomes 


» || ev + ety + esi od + ME | as =o, (6.30) 


c?r 


where it should be understood that the dot above the r, 0, and @ indicates 
a differentiation with respect to s. 
The Euler-Lagrange equations for Eq. (6.30) are 


d oL OL 
dsóü* ôu" 








where L designates the integrand of Eq. (6.30) and u’, u?, u? represent 
respectively r, 0, and ġ. 
The Euler-Lagrange equations for 0 and ¢ are 


d 7 ; 
Pa (2r?0) = 2r? sin 0 cos 0($? (6.31) 
s 
and 


4 ar iad. (6.32) 
ds 
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Fig. 6.1. The rate at which the shaded area 1s swept out by the radius vector is the areal velocity. 
The fact that the areal velocity is constant for planetary motion about the sun 1s Kepler’s second 
law. 


It is well known that a particle acted on by a central force stays in a 
plane. We can choose this plane to be 0 = 72/2. This solves Eq. (6.31). 
Equation (6.32) then becomes 


don 
qu $) —0 
or 


17h = a/c. (6.33) 


The constant «y is chosen so that it has a standard interpretation. In 
particular 


= 1,2 
Xo = of E 


To interpret this, we note that 3r? dọ represents an infinitesimal area swept 
out by the radius vector from the sun to the planet. (See Fig. 6.1.) Thus x, 
is the areal velocity—that is the rate at which this area is swept out. The 
fact that the areal velocity is constant is Kepler's second law. 

Rather than examine the Euler-Lagrange equation for r, it is easier to 
take advantage of the fact that s does not appear explicitly in the integrand 
of Eq. (6.30). This means that 


ou 


(: aL ) 
u^ — — L | is constant, 


and this gives us our constant energy equation: 


(?Y? + A(xg)?/(c?r?) — 2MG/(c?r) = 2E/(mc?) 
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or 


(P? = 2bE/(mc?) + 2MG/(c?r) — 4(%)?/(c?r?), (6.34) 


where E is a negative constant representing the total Newtonian mechanical 
energy of the system and m is the mass of the planet. 
Equation (6.34) is more tractable if we substitute 


(6.35) 


Furthermore, the resulting equation is easier to solve for u as a function of 
$ rather than as a function of s or t. From Eq. (6.35) 


1-1 de 
pes (6.36) 
From Eq. (6.33), 
È = 2ag/(cr?) = 2a9(u)?/c. (6.37) 


Using Eqs. (6.35), (6.36), and (6.37), Eq. (6.34) becomes 


du\? E MG 
= + — 2 = P R 6.38 
($) 2ma2 2o dna ce) SR) 





The polynomial P.(u) is of course a parabola. Since the left-hand side 
of Eq. (6.38) must be positive, the physically meaningful values of u are those 
for which P,(u) is positive. (See Fig. 6.2.) 

If we designate the two roots of the polynomial by uy4x and uym then 


Pelu) = (Umax — uu — umm). (6.39) 
Using this form for P.(u), we can rewrite Eq. (6.38) to obtain 


n +du 
Ni (Umax — u)(u — umn) 








dọ (6.40) 


Suppose the planet moves in the direction of increasing ¢ and suppose 
that we start the measurement of $ from the position of perihelion where 
r = fyn and u = uyax. From that position, u decreases while $ increases. 
Thus we must choose the negative sign in Eq. (6.40) so 


kaal E (641) 


UMAX ad Chins ae x)(x EX umn) 
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Fig. 6.2. For physically meaningful values of u, the parabola P.(u) = (Umax — u)(u — umn) must 
be non-negative. 


By completing the square, the integrand can be reformulated so that 











$= -Í ie (6.42) 
wx AJ (c[b?)* — (x — a/b?) 
where 
c 2 Umax — UMIN 
b? 2 
and 


a _ Umax + UMN 


b? 2 





Carrying out the integration, we have 


eee (= — a) / (a) 
arccos| | x n 2) tesa 
$ = arc cos (u — 5) / (3)] (6.43) 


1 a c 
u CEDE DS E BUS ó. (6.44) 





and thus 


or 
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P 


Fig. 6.3. The ellipse x?/a? + y?/b? = 1 with focal points at F; and Fy. 


This is the equation for an ellipse where r is the distance from one of the 
focal points, a is the magnitude of the semi-major axis, b is the magnitude 
of the semi-minor axis, c is half the distance between the focal points, and 
is the angle swept out by the radius vector from the position of perihelion. 
(See Figs. 6.3 and 6.4 and Problem 6.2.) 

The fact that, according to Newtonian physics, the orbit of a planet is 
represented by an ellipse with the sun at one of the focal points is Kepler's 
first law. 

Equation (6.43) is valid only between the initial perihelion point and the 
first aphelion point encountered by the planet. At aphelion r = fmax and 
u = uy = (a — c)/b?, so at that point 


$ — arc ed C98) ] = arc cos( — 1) = m. 


Immediately after the point of aphelion, both du and dó are positive. The 
equation for $ then becomes 








p=r+ | 2 : 
umm «/(¢/b?)? — (x — a/b”)? 


F1 
2a ud 


Fig. 6.4. The ellipse 1/r = a/b? + (c/b?) cos $. This 1s the same ellipse as shown in Fig. 6.3. 
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Or 
go =m — arc ses (u — FAI + arc cos( — 1). 
Thus 
a C 
ob = 2x — arc es (u = 5) | (l 
or 


1 a c a c 
ur OE doy. ne. 


Therefore Eq. (6.44) remains valid for the entire orbit even though Eq. (6.43) 
is valid for only one half the orbit. 

It is also possible to get a simple formula for the period of revolution 
t. The formula is 


t? = (4n7a3)/(MG). (6.45) 
To derive Eq. (6.45), we first note that a) represents the areal velocity so that 


dA 


o=— 


dt 


where A is the area swept out by the radius vector from the sun to the planet. 
Integrating over one period of revolution, we get the area of the ellipse, that is 


A = nab = MT. (6.46) 


To get our desired result from Eq. (6.46), we need a formula for a. This 
can be obtained by combining Eqs. (6.38) and (6.39). This gives us 


( u)(u — umn) E + “s u 
ü Em B 
MAX MIN 2ma2 2a2 





Equating the constant coefficients of u, we have 


E. MG 
Umax T UMN = 75 - 
202 


But from Problem 6.2, uyax + Uyin = 2a/b?. Thus 


MG MGP 


(647) 
2(Umax + Umm) 4a 


x2 = 
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Thus, from Eqs. (6.46) and (6.47) 








and this is our desired result. 

The relation that the square of the period of each planet is proportional 
to the cube of the semi-major axis is Kepler's third law. 

The fact that Newton could derive Kepler's three laws from the simple 
inverse square force law was a great triumph for Newton's theory of gravity. 
Einstein's theory of general relativity adds only a very minuscule correction 
to these results. 

We now turn to the problem of determining the planetary orbits 
predicted by Einstein’s theory. In Einstein's theory, the planetary orbits are 
geodesics of the Schwarzschild metric. Perhaps the quickest way to obtain 
the equations for the geodesics is to get the Euler-Lagrange equations for 
the calculus of variation problem 


ô IC — - c(t)? — ( ~ -) (y. — £*(0)* — r° sin? i ds — 0. 
(6.48) 


(See Prob. 6.3.) 
As in Eq. (6.30), the dot above the t, r, and @ in Eq. (6.48) indicates 
differentiation with respect to s. Three of the four Euler-Lagrange equations 


are as follows: 
d 2m\. 
I 7) | =0 (6.49) 
ds r 


d i A 
m (r?0) = r° sin 0 cos 0(¢)*, (6.50) 
s 





Se sin? 09) = 0. (6.51) 


We note that Eqs. (6.50) and (6.51) are identical to their classical 
counterparts. In solving the Euler-Lagrange equations, we again pick out 
the plane for which 0 = x/2. We then have, as before: 


Se =0 and iP = a/c. (6.52) 
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Furthermore, one can integrate Eq. (6.49) to get 
2mY. 
( = x): Ed (6.53) 
r 


where k is understood to be a constant. 

In solving the Newtonian system of equations, we got Eq. (6.34) by 
taking advantage of the fact that s did not appear explicitly in the integrand 
of Eq. (6.30). We could do the same thing here, but it is easier to obtain the 
same result by simply noting that from Eq. (6.28), 


(s) =1= ( -— aer — ( — T (y — ry. — r? sin olh). 
ds r r 
(6.54) 


This simplifies when we set 6 = 1/2 and 6 — 0. In addition, Eqs. (6.52) 
and (6.53) can be used to eliminate $ and f from Eq. (6.54). The result is 


2m  4o$ | 8maj 


2,2 2,3" 


(y = —(1 — kc?) + — 
r cr cr 


(6.55) 





Comparing Eq. (6.55) with its Newtonian counterpart Eq. (6.34), we see 
that if we identify m with MG/c?, the two equations are virtually identical 
except for the last term on the right-hand side of Eq. (6.55). 

As before, we substitute r — 1/u and change the independent variable 
from s to $. Following the previous calculation, we have 


s E. 1\du,_ &o \ du 
ud (5) (i) ag (5) 


and Eq. (6.55) becomes 
(s) co EK eu" " mc? 
dọ 4(o) 2(uo)* 
= P,(u). (6.56) 








(u)? + 2m(u)? 


= 


For a planetary orbit, P,(u) must not only be nonnegative but u must 
range between two roots of Pp(u) which are labeled by umn and uyax in Fig. 
6.5. 

Writing P,(u) in terms of these two roots give us 


Pelu) = (Umax — uu — Uy )(A + Bu). (6.57) 





Multiplying out the right-hand side of Eq. (6.57) and comparing the 
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Fig. 6.5. The cubic polynomial Pp(u) = (Uyax — uu — ummn)(A + Bu) must be nonnegative for 
physically meaningful values of u. For a planetary orbit, u must range between uy and uyAx. 


coefficients of (u)? and (u)? with those in Eq. (6.56), it becomes clear that 


B = —2m (6.58) 
and 


4 
A = Tong + tmn) = 1 — ur (6.59) 


Equations (6.57), (6.58), and (6.59) can now be used to modify Eq. (6.56). 
If @ is measured as before from the position of perihelion, then Eq. (6.56) 
rewritten in integral form becomes 


b= | : i . (6.60) 
UMAX (imax — x)(x — uy (CL — (4ma/ b?) — 2mx 








Expanding the integrand of Eq. (6.60) as a power series in m and 
retaining only first-order terms results in the relation, 








ds -l (1 + 2ma/b?) + mx) dx 


. (6.61) 
HMAX V (max — x)(x — umn) 


With this approximation, it is possible to carry out the integration in 
much the same way as the corresponding integral was done for the 
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Newtonian orbit. By completing the square for the quadratic under the 
square root sign and regrouping the terms in the numerator, Eq. (6.61) 
becomes 





(6.62) 





ns | [(1 + 3ma/b?) + m(x — a/b?)] dx 
wx — XKclby-(x-—alby 


Carrying out the integration, we have 


3ma u — a/b? 4 
b= (1 + m) arc cos( “= 2") + [max — uu — umn) ]?. (6.63) 


At the first aphelion, u = uy = (a — c)/b? and 


3 3 
d= (1 + m) arc cos(—1) = «(i t m) 


When the planet attains the position of perihelion again 


3ma 
ó = (i T m) . 
Therefore over each period there is an advance of perihelion by the amount 
ma 


OG = DNs 


Noting that m = MG/c? and 1? = 4n?a3/MG, we have 


Ad 3(MG)P 1 


T co gb 





(6.64) 


For the case of Mercury, Eq. (6.64) translates into 43.03" per century. 
Actually the gravitational forces due to other planets cause Mercury to 
precess at a rate far greater than this. However, during the middle of the 
nineteenth century, it became clear that the gravitational forces due to the 
known planets could not completely account for the precession of Mercury. 

At the time, the natural explanation was that there existed a still 
undiscovered planet between Mercury and the sun. Anomalous behavior in 
the orbit of Uranus had also been observed. The British astronomer John 
Couch Adams and the French astronomer Urbain Jean Joseph Leverrier 
independently made very lengthy computations to predict the position of 
the then undiscovered planet Neptune. Leverrier sent his results to the 
German astronomer Johann Gottfried Galle of the Berlin Observatory. With 
this information, Galle was able to locate Neptune on September 23, 1846. 
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When Leverrier turned his attention to the advance in the periheliion 
of Mercury, he found that he could account for the unexplained portion of 
the advance by hypothesizing the existence of a planet between Mercury and 
the sun. This theoretical planet became known as Vulcan and many attempts 
were made to find it. These attempts all ended in failure. In 1898, the 
American astronomer Simon Newcomb published a paper in which he 
calculated the residual advance unaccounted by the known planets to be 
41.24" 4- 2.09" per century (Newcomb 1898). More accurate estimates of this 
residual advance have been made in recent years, but they remain consistent 
with the value of 43.03" predicted by Einstein’s theory (1915c). 


Problem 6.2. A point on an ellipse can be characterized by the fact that its 
distance to one focal point plus its distance to the other focal point is equal 
to some constant which is the magnitude of the major axis. Referring to Figs. 
6.3 and 6.4, it is clear from the law of cosines that 


(r' = r? + (2c)? + 4cr cos o. 
(1) Using the fact that r' + r = 2a, show that 


a 


b? 


1 c 

-= — 7750608 Ọ. 

r b? ? 
(From Fig. 6.3, it should be clear that b? = a? — c?) 

(2) From Figs. 6.3 and 6.4, it follows that ru = a — c and max = a + c. 
Use this to show that 








Umax + Umin _ 4 did Umax — Ug _ C 
2 b? 2 b? 


where u = 1/r and uyax = 1/rmn While umn = l/ryax. 


B B 
i-| as= | Lda 
A A 


where L = (g,5u*ü^)* and u* = du*/d4. 
The problem of determining the equation for a geodesic path is 
equivalent to solving the calculus of variations problem ôI = 0. 


Problem 6.3. Suppose 


(1) Show that the Euler-Lagrange equations for this problem can be 
put in the form 





a n v 
= x a ) pua e (6.65) 
da" dà 
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(2) Using the fact that 


du* M dA du* » du* 


ds = L då Or [o = : 
dài dsdi ds 





show that 


d?u* du" du” 
+1," = 
ds? ds ds 





(6.66) 


(3) Suppose I = fa [L?] ds. Consider the problem of solving the equa- 
tion ôI =0. Show that the Euler-Lagrange equations for this 


problem also result in the system of equations for a geodesic. (Eq. 
(6.66.) 


7 
TWO DIFFERENTIAL OPERATORS 


7.1 The Exterior Derivative d and the Codifferential Operator 6 Related to 
the Operator V = ;/V, 


The operator that appears in Dirac's equation for the electron is V = y/V,. 
Let us examine what happens when this operator is applied to a p-vector in 
a space that may be curved. Suppose our p-vector is written in the form 


F = T NS da ode (7.1) 


! 


"rne 


where the components of ¥ are assumed to be totally antisymmetric with 
respect to their indices. Then using a modified form of Eq. (5.20), we have 


1 


VF = VF = p Iusta Pros (7.2) 


Now let us consider the product y“y/'/2-: 4», To get a handle on this 
product, suppose we were dealing with an orthonormal system of Dirac 
matrices. We would then have 


jns» — $595 n pr, 
If the index k is distinct from all the j,’s then this product is the (p + 1)-vector 
gun jp. 


On the other hand, suppose k = j,. In that case, the product becomes a 
(p — 1)-vector. In particular 


P ea PESADE pep qe opis 
= (—1) 1n jfi m" fir. PP, 
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where #/r denotes the fact that $7" is missing from its original position in the 
product. Summarizing the possibilities, one has 


p ^ 
jhjnjs jp — jus» + X (—1) 7 !nr$ni. Jr sod (7.3) 


r=1 
where j, denotes a missing index. 


Equation (7.3) easily generalizes to any system of Dirac matrices. (See 
Problem 7.1.) In general one has 


: p : à . 
yhnn- Jp — ju eB Y (—1)y 1g yn urs» (7.4) 


r=1 


Inserting this result into Eq. (7.2) gives us 


"T 
VF = yi Posse? Jp 
Ps r-i kjen JJ2- Jr J 
to 2 CoD Pascua eene (7.5) 
But 
PP = -DT E uh RR d 
35 (ey e Je eJpik! 
Thus 
VF = J F kyij2- jp 
di ~ pl j2. Jmk? 
E k juae ` 
+ p! 2 F Aja..Jr-. pik? MTS (7.6) 


A 


If we relabel the index sequence jija. -. jr- . -jpo by Jija ---jp-1, it becomes 
clear that the sum on the right-hand side of Eq. (7.6) consists of p identical 
terms. Thus we finally have 


1 


a nykj1J2+ Jp 
p! I -Jp k? 


VF = 


1 


+ (p— Di " Di PP a DS E jp-1. (7.7) 


We now split V¥ into two pieces—a (p + 1)-vector and a (p — 1)-vector. 
This decomposition now enables us to define the exterior derivative d and 
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the codifferential operator 6. In particular 


1 


dF = = | DAPI po (7.8) 


and 


1 k 
DiE n deck? 


Gr Jija.. Jp-1 


1 


Soe ee .Jp-1? (7.9) 
where in both cases 
1 NS 
F =—F yizo de, 


" Jij2 Jp 


It should be noted that the sign for the exterior derivative d is well 
established and we have followed that convention. On the other hand, the 
sign convention for the codifferential operator 6 is not so well established. 
For the case of a positive definite metric, our sign convention is identical to 
that used by Heinrich W. Guggenheimer (1977, p. 329). It is opposite to the 
sign used by Harley Flanders (1963, pp. 136-137). 

In coordinate frames, both Eq. (7.8) and (7.9) can be cast in more useful 
forms. Let us first consider Eq. (7.8): 


1 
= fif2.. B 
dF = nt Fapa <p aY" iun * 


1/0 p 
=—(—F — I$"F, apıP2. RE: (710 
p! (2 B:B2...Bp 2. B Ppifa . a (7.10) 
where the index y appears in the sequence p,p... fj, in the kth slot vacated 
by fx. It should be apparent that, because of the symmetry of the indices, 


none of the terms under the summation symbol make any contribution to 
dZ. For example, by switching the position of « and f,, we have 


Dye -Bp — ae ac Bp 


If we then reverse the labels of the dummy indices « and f,, we have 


ie “Bp — Tog a -Bp 


and thus 


4 yes Bp — — Typ, ty io Bp — 0. 
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In a similar fashion, 
T5, ye he ofi = 0 fork =1,2,...,p (7.11) 


Thus Eq. (7.10) becomes 


1/0 
M" e Fg, nha P (7.12) 
where 
F = i Pn Y Be, 


In the formalism of differential forms the analogous equation would be 
written in the form 


6 
dF = es Fitts) du? du”! du’... dub (7.13) 
p! \du 
where 


1 
F = p! Fy8;... fp du^: du^? ern du^», 


Now let us turn to Eq. (7.9). This equation is more difficult to deal with 
but it too can be cast in a simpler form. We first note that 





1 upip2 =I 
= E EE PED, EN. 
= ( i 1)! E "arde pna cm Tu EIE TI 
p- 1 u 


p 
+ 2 MO ad td tete a, Baias (7.14) 


It should be observed that L,,/* F??:/:- -/»-: = 0 for essentially the same 
reason that Eq. (7.11) was shown to be valid. 

We are now forced to consider the nature of the contracted Christoffel 
symbol T,“ 





g” (gon c Oda e) 
I, Te + , 
fou (25s ðu” — Qw* 
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and thus 


a av Gay 


an 


(7.15) 


This can be simplied further. If g = det(g,,), then 


g = Yay X (the cofactor of g,,) + terms not involving ga. 





Therefore 
Q ð 1 . ô 
99 _ Gav x (the cofactor of gą) + terms not involving OG 
ou" ĝu” ou" 


To be more is 


Ei | = y|% x (the cofactor of «|. 
u ay 


Anyone who has computed the inverse of a matrix should know that the 
cofactor of g,, is g^'g. Therefore 


0g Gay 
DELE g^g Gay 
Qu" Qu" 


and Eq. (7.15) becomes 


1 ôg 1 dlg| 1 6 
T" = = = JJ lgl- 7.16 
"  2g0u" 2l|g| Ou" /|g| Ou" ei VAR) 





With this result, Eq. (7.14) becomes 


1 We 
oF = ( (Vgl Fh y 182...Bp-1 OAD) 
(p — D! /igi \eu SEEN 





where as before 
F = l prao..p 
= al YBiB2.. fp: 


A simple equation follows almost immediately from Eq. (7.12); that is 
ddF = 0. (7.18) 


This equation is labeled the Poincaré lemma by some authors and the 
converse of the Poincaré lemma by others. To verify this equation, we note 
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that from Eq. (7.12), it follows that 
1 @& 





gr y YBif2.. Bp 
ndr IE 
Since 
e e 
du” du" du" du” 
and 


yP B2 Bp Z —ynfifa ; Bp 


it immediately follows that dd¥ = 0. 

If F is a p-vector such that d¥ = 0, then F is said to be closed. If 
there exists a (p — 1)-vector 4 such that F =d@, F is said to be exact. 
Obviously if F is exact then it is also closed (Poincaré’s lemma). However, 
the converse of Poincaré’s lemma is true for any region that can be 
continuously deformed to a point but it may not be true globally (Flanders 
1963, pp. 27-29). 

Examples of closed 1-vectors are upper index coordinate Dirac matrices. 
To verify this, we note that 


dy? = y ^ V," Er —yt A T^y" = -Ln/»y" = 0, 


since L^ = I. 
Locally any of these upper index coordinate Dirac matrices are exact 


since 
ys = »PVyu* = du’. (7.19) 


On the other hand, we may not be able to extend the coordinate u* over the 
entire space. Consider for example, the use of polar coordinates for R?. In 
that case y? is not defined at r = 0. It is true that y? = d0, but 0 cannot be 
extended in a continuous manner to cover R? even if the point where r = 0 
is omitted. (See Fig. 7.1.) 

A theorem analogous to the Poincaré lemma also applies to the 
codifferential operator ð; that is 


SF = 0. (7.20) 


To verify Eq. (7.20); we will first demonstrate a relation between 6 and d 
that can be expressed simply in terms of the normalized pseudo-scalar J. 

The n-vector J can be expressed most simply in terms of orthonormal 
Dirac matrices. In particular 


J = 9192... Pn = 512 ant (7.21) 


Since the space of n-vectors is 1-dimensional, Eq. (7.21) defines J uniquely. 
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It should be clear that 
with J if n is even. This is true because ?; commutes with itself and 


anticommutes with each of the remaining (n — 1) 


^5 


;S that are used to form 


7; commutes with J if n is odd and anticommutes 


2 a e 
5 N N 
< = = 
a 

o 
D 

3 
S 
5 

g 
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o 
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3 

o 
S 

3 = 
E = 
; ; 

“ E = 
x F ii 
bN ~ 
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D lI 
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v 2 

a 
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aeo 
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2E 
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To show this is true, we note that 


= —nJ + JIX. 


ag Ts + JI, 


VJ => 


From Eq. (7.22), it is clear that J commutes with any p-vector of even order. 


th T, and thus V,J 


Wi 


Since T, is a 2-vector, J must commute 
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Taken together, Eqs. (7.22) and (7.23) imply that 
JVJ£ —JV)JE = (1P ^ 10yv. (7.24) 
Now it is not difficult to show that 
(JY e (DC Dem, (7.25) 


where s is the number of — f's on the diagonal of the signature matrix. (See 
Problem 7.3.) When this result is applied to Eq. (7.24), we get 


IVIF -(—1y*(—1ye*o2yg 


and therefore, 
UF + 8F —(—1y*!(—1)y0*92(J0JZe + JJF). (7.26) 


Now if F is a p-vector, then JF is an (n — p)-vector, dJF is an 
(n — p + 1)-vector, and finally JdJF is an n — (n — p + 1) or(p — 1)-vector. 
A similar walk through the computation of J6J4 reveals that it is a 
(p + 1)-vector. If we now equate the (p + 1)-vectors and the (p — 1)-vectors 
that appear on the two sides of Eq. (7.26), we have 


dF —(—1y*!(—1y"*U2JàJge, (7.27) 
and 


SF —-(—1y*!(—1y*»2JgJz. (7.28) 
From Eqs. (7.28) and (7.25), we now have 
SSF = (JdJ)(JdJ) £ = Jd(J)?dJ.£ = (—1y(—1)"-92Jdq(J.7 ) = 0. 


In this fashion, we have verified Eq. (7.20). 

In closing this section, 1 would like to draw your attention to two items 
if you are already familiar with the formalism of differential forms. One: in 
section 3.3 of Chapter 3, we noted that the upper index Dirac matrices are 
the Clifford algebra analogues of the 1-forms that appear in the formalism 
of differential forms. Now that we have introduced the exterior derivative d, 
this point arises again. This is particularly true when we compare Eq. (7.12) 
with Eq. (7.13). It also crops up in Eq. (7.19). Second: multiplication by the 
normalized pseudoscalar J is essentially identical to applying the Hodge star 
operator (Flanders 1963, pp. 15-17). Except for some differences in sign 
conventions, the two entities considered as operators are identical. However, 
for computational purposes it is much easier to use the operator J in the 
context of Clifford algebra than it is to use the Hodge star operator in the 
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context of differential forms. We will take advantage of this to compute the 
Kerr metric in Chapter 9. 


Problem 7.1. Multiply both sides of Eq. (7.3) by WW, ^ WP... W, ^» to 
show that 


ptpPiBa---Bp ybi Bo 4 Y (— 1E 1g Pila ss Bess Be 
k=1 


(This should convince you that Eq. (7.4) is also valid for noncoordinate 
bases. The proof of Eq. (7.4) in its full generality is awkward only because 
of the limitations of the notation chosen for this text. Mathematicians need 
a few more alphabets to do their thing.) 


Problem 7.2. Show that if F is a p-vector, then Ft —(—1)»-U22. 
Problem 7.3. Show that (J)? = (— 1)(—1)""~ 2T, where s is the number of 
—1’s on the diagonal of the signature matrix. (Suggestion: compute JJ‘ and 


use the result of Problem 7.2.) 


Problem 7.4. Suppose that ¥ is a twice differentiable Clifford number. 


(1) Show that VPF = (dë + 8d) F. 
(2) Show that if F is a p-vector then V?¥ is also a p-vector. 


Problem 7.5. Show that if F is a differentiable p-vector and @ is any 
differentiable Clifford number, then 


dF ^ -—(d)^d4--(—1)"Z a dG. (7.29) 


7.2  Maxwell's Equations in Flat Space 


In the units of Heaviside- Lorentz, Maxwell's equations are as follows: 


V-B - 0, (7.30) 
COB > b 
E E (1.31) 
c ôt 

VE =p, (7.32) 
WDE uw lb 
a Ve Baas, (7.33) 
c Ot c 


Furthermore, if @/ét is applied to both sides of Eq. (7.32) and V* is applied 
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to both sides of Eq. (7.33) and the two resulting equations are added, one 
obtains 
OD =e 


EV -0. (7.34) 
at 


A related equation is the equation of motion for a small test particle 
due to the Lorentz force. That is 











mc? ut (qE +a x B), (7.35) 
ds 
where 
gotas ( v,/C v,/c v,/c ) 
did JA — (olo JA — lo JA - (loi 


and q is the charge of the test particle. 
To formulate these equations in terms of Clifford algebra, we first 
introduce a coordinate system of Dirac matrices. Let 


X^ghos Ux. RS tee 


(Yo)? = = (7)? = — (7)? = —(y3)? =I; 
and 


Yale +Ve%a=9 fora z f. 


It is also useful to introduce the Faraday 2-vector F = }3F,,)"" which is 
defined by the relation 


0 E, E, E, 
Ej 5 0 5 8, dis 
T dem. Heo Se^ ege l 


-E, -B, B 0 


With this definition, it is not difficult to show that Eqs. (7.30) and (7.31) can 
be combined into the single equation 


dF =0. (7.37) 


The signs for the Faraday 2-vector depend on the signature of the metric 
tensor. Whichever signature is used, the sign is generally chosen so that the 
equation for the Lorentz force can be incorporated into an equation written 
in the form 


mc?^a* = qF" guf, (7.38) 
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where m is the mass of the particle and q is the charge of the test 
particle. 

The tensor character of F,, serves to underline the intimate connection 
between the magnetic and electric fields. An observer who is stationary with 
respect to a system of charges which are stationary with respect to one 
another will observe an electric field only. However, an observer in another 
coordinate system which is moving with respect to the first will observe both 
a magnetic and an electric field. 

To compute the codifferential of #, we note that from Eq. (7.17) for 
flat space, 





ô 
62 = F® jy, 7.39 
(s »s ( ) 


To examine the detailed structure of Eq. (7.39), we need the contravariant 
components of the Faraday tensor. These can be computed from Eq. (7.36). 
The result is 


Ü ES -E, -E 


y z 
E, 0 -B, B, 

[F*] = : (7.40) 
E B, 0 -—B, 
E, -B, B, 0 


From Eqs. (7.39) and (7.40) one gets 


TES (S E ôE, n e) 2 (m à ôB, — 2) 
ôx | Oy @z ^d c ôt Oy Oz " 


à (B i: >.) x (m n ôB, r3 
côt ôx Oz m c ôt ôx oy "e 











Using Eqs. (7.32) and (7.33), this becomes 
J. J J, 
iF = (on +y hys £t). 
c c é 
Thus it is natural to introduce the current I-vector 
J. J J, 
cd y put hot du ys. (7.41) 


With the introduction of this current 1-form, Eqs. (7.32) and (7.33) can be 
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combined into the single equation 
SF =I. (7.42) 
Writing out Maxwell's equations in terms of vector components requires 
eight equations. Using vector notation, it is possible to reduce the number 
of equations to four. We see that using the formalism of differential forms, 
only two equations are necessary (Eqs. (7.37) and (7.42)). However, using 


the language of Clifford algebra, it is possible to condense the entire system 
into a single equation. Since VF = dF + 6, we have 


VF = f. (7.43) 


Since d¥ = 0, we know from the converse of Poincaré's lemma that 
there exists a vector potential J such that at least locally 


F =de. (7.44) 


Writing out the components of Eq. (7.44), we have 


OA 0A 0A 0A 
F =dd = RES) »«( o z) 2 
e ox? 7 Ox? ox? 


0Ag OCA; 30 0A, OA, Te 
«C dy i ôx? ôx? I 


0A, 2) si (26 2) m 
tlm + | = - 
(4 ôx! ? Ox! Ox? j 


= —E,y'® pe Epy”? E E,y?? 
= yes s B,y** Dt B,y'?. 


Since 
E = 0A m" 
E= —V¢ —-— and B=Vx A, 
c ôt 
we see that 
A = A,y* = by — Aa! — Ap, — AY, (7.45) 
or 


A = A*y, = yo + Axy, + Ayy2 + Azys. (7.46) 
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(Regardless of the signature used, the signs in the equations are adjusted so 
that (Al, A’, A?) = (Ax Ay, A,).) 
Summarizing our equations, we have 


F =a, (7.47) 
dF = dd = 0, (7.48) 
dF = 5d = f. (7.49) 


It is sometimes useful to divide electromagnetic phenomena into two 
categories—those which are caused by free charges and free currents and 
those which are caused by bound charges and bound currents: 


I = J FREE T A sounn (7.50) 
and 
F = F’ + M. (7.51) 
It is understood here that 
A rage = OF" (7.52) 
and 
J BOUND = 6M. (7.53) 
If F' = Fy, ,, then 
0 -D —D, —D, 
D, 0 —H, H, 
[F] = (7.54) 
D H, 0 —H, 
D, —H, H, 0 


In addition, if M = 1M*^y,,, then 


Br R P XE 


€ -P 0 -M, M, uon 
^ | -P M 0 -M 


y z x 


-B -M, M, 0 


In standard books on electricity and magnetism, Eq. (7.51) appears as 
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two equations, namely 
E-D-P (7.56) 
and 
B=H+M. (7.57) 


Problem 7.6. Since 7 = 64, it immediately follows that 6,7 = 0. Compare 
this result with Eq. (7.34). 


Problem 7.7. The equation ôd. = J is usually solved for «7 with the 
subsidiary Lorentz condition that 


Show that this is equivalent to the condition that 6. = 0. Also show that 
with the Lorentz condition, Eq. (7.49), can be rewritten in the form 
Vi = f. 


Problem 7.8. 


(1) Show that the Lorentz force Eq. (7.38) can be rewritten in the form 


q 


ü(s) = V,u(s) = ma 





(F (s)u(s) — uls) F (s)), (7.58) 


where F is the Faraday 2-vector. 
(2) Since the world velocity vector does not change length, it undergoes 
a generalized rotation or Lorentz transformation. That is 


u(s) = L(s)u(0) LH t(s). (7.59) 
Use Eqs. (7.58) and (7.59) to show that 


q 


Vee) = 2mc? 





(s) (s). (7.60) 


(if you get stuck, you may wish to review the derivation of Eq. (5.78).) 
(3) Suppose F (s) = F (0). Show 





L(s) = epis 7) (7.61) 
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(4) Suppose F (s) = F (0) and the electric field E = 0. Show 


sqB . (| sqB \2 jB B, 
(s) =I cos sin +2 y1 12) (7.62 
(s) (2) Ea B t B , B d (62) 


where B = (B-BÈ. 





Problem 7.9. 
(1) Show that 


J-E J xB 
HFS — $F) = ro( 2 ) + X oh gx). 
(2) Show that 
(FyF),;= FI — IF . 
(3) Suppose T? = (PFY F). Show that T* = T”, (If you wish to 


cheat, look at Problem 3.18.) 
(3) Show 


tim 3004 -IF Yo = V, UFI — 49». 


(This last result combined with the result of part (1) in effect shows 
that 


TE | B) 
T9* , = — and Puegl( a 
c 


for i= 1, 2, and 3. The tensor T* is known as the electromagnetic- 
momentum tensor.) 


Problem 7.10. 
(1) Use Eq. (7.49) to show that 


AP, — AM e JI 


(2) Use the result of Problem 5.18 and the Lorentz condition to show 
that in flat space A. , = J*. 


Problem 7.11. 
(1) Consider the Lagrangian 


L = ámc^g,4X^x? + qx*A,, 
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where x = dx*/ds. The corresponding Euler-Lagrange equations are 


d dL 


——-=0 forA420,1,2, or 3. 
ds 0x* 





Show that these equations are the component equations for the 
Lorentz force; that is 


mc? (X" + T,,"x*x9) = qF*,x*. (7.63) 


(Comparing Eq. (7.63) with (7.38), one might notice an apparent 
discrepancy. If a’ = X", then the equations would be at odds. 
However, 


a = ay, = VAŠ Ya) = X*y, + XXIV G Ya = Xy, + KOT", 


Thus a” = X* + I5,"x*x?. 
(2) Show that the corresponding Hamiltonian is 


H =x" T — L = im?u’u, = imc?. (7.64) 
x 


73 Is Gravity a Yang-Mills Field? 


Those attempting to construct a unified field theory look for similarities in 
the equations which characterize the four fundamental forces of nature; the 
electromagnetic, the weak, the strong and the gravitational. The gravitational 
force seems to be unique in that unlike the other forces it is not considered 
to be a Yang-Mills field. Nonetheless, in the formalism of Clifford algebra, 
there are very strong similarities. These similarities are so strong that one is 
tempted to claim that gravity is a Yang- Mills field. 

What is a Yang-Mills field? For the strong and weak forces, the 
components of the vector potential used in electromagnetic theory are 
replaced by matrices referred to as connections or components of a gauge 
potential. In addition, the Faraday tensor F,, is replaced by a field strength 
tensor. In particular 


A, ôA, 
"7 gxt ox" 





+ ig[A,, Ag], (7.65) 


where 


[An Ap] = AgAg — AgAs- 


Of course for the electromagnetic case, [A,, Ag] = 0. 
In this context, one introduces a covariant derivative V,. Acting on a 
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vector field @, we have 


Ó i 
Vu = ? t igA,@. 
ôx” 





On the other hand, when V, acts on some matrix field M, we have 
6 : 
VM = Er M c ig[A,, M]. (7.66) 
x 


Using Eqs. (7.65) and (7.66), one can prove a Bianchi identity: 
V,E,, + V,F,, + V,F,, = 0. (7.67) 


Note: Sign conventions vary, but for the case of the electromagnetic field 
applied to Dirac's equation for the electron, one writes 


ô 


Va = 
y ox" 





ie 
+— AW. 
V ^s y 


We have chosen a sign convention that appears most compatible with this 
special case of a Yang- Mills field. 
In contrast to Eq. (7.66), some textbook authors define 


V, = X + igA,. (7.68) 
ox" 


Without additional multiplication conventions, this definition is incompat- 
ible with the Bianchi identity. (Also see Problem 7.12.) 
For the electromagnetic case, Eq. (7.67) reduces to 


ð ô 0 
FE, Fa + 
axe " Ox" ox” 








E, = 0. (7.69) 


Equation (7.69) is equivalent to Eq. (7.37) which is also known as the 
homogeneous set of Maxwell’s equations. 

Generally the equation for the gauge field A, or the field strength tensor 
in terms of a source term J^ is derived by carrying out a variation on the 
action 


S= [t-te + J*A,)./ —g dx? dx! dx? dx? 


= | ricco, + J*A,),/—g dx? dx! dx? dx? — (7.70) 
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where Tr(M) is the trace of the matrix M. We should note that J* is generally 


constructed from interacting fields. 
To carry out the variation, we see that 


6S = fr 1(0F,,)g"gP F,, + F,5g"gP"ÓF,, 











+ J*6A,)./ —g dx? dx! dx? dx? (7.71) 
where 
0A 0A, 
Fg = P a 
ôx” — Ox 
and 


ÒF, = 91 by - 9, A : 


+ igót0 A,A, + ig 05A,0A, 
When these terms are plugged back into Eq. (7.71) they must be sorted 


out with some care since the matrices do not commute. The first term that 
occurs in Eq. (7.71) is 


fr (- ions )o ang Fw — Jr dx! dx? dx? 
= In(-; ( ; Foy x) dx? dx! dx? dx?. 
x 


4 





Integrating by parts, this becomes 
1 ô 0 0 1 2 3 
oes Tr(O AF”, / — g) dx? dx! dx* dx 
x 
1 ô 
+ T In(ou ae (F*. / =0) dx? dx! dx? dx?. (7.72) 
X 


The first integral is zero if we assume 6A, = 0 on the boundary of our region 
of integration. Since F” = — F**, we can rewrite the second integral in the 
form 


- Jn(o^. 4 (F9. / =0) dx? dx! dx? dx?. 
x 
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Another term that occurs in Eq. (7.71) is 
1 j 
* | teins Sua) / —g dx? dx! dx? dx? 
= -i [| ric tonar. / — g) dx? dx! dx? dx?. 


Since Tr(ABC) = Tr(CAB) = Tr(BCA), this last term can be rewritten in the 
form 


1 
E | nioasiar". / —gA,) dx? dx! dx? dx? 
1 
= 2 [ tHeAsGar™ / —gA,) dx? dx! dx? dx?. 


When all the terms in Eq. (7.71) are dealt with in a similar manner and 
like terms are collected, we get 





ôS = — | (64 s (F**. / — g)  iqA,(F*,/ — 9) 
Eu iq(F**, / —g)A, = =a) dx? dx! dx? dx?. 
This implies that 
SF = 9) iA FI ST) — aE” /—0) 4, = Ia 


or 
1 
—— V,(F*./—g) = J’. (7.73) 
vV Tg 
In the electromagnetic case, this becomes 


1 ô 
(Fs. / —g) = J°. 
-g Ox* 


According to Eqs. (7.9) and (7.17), this is equivalent to the equation 





Fe = Jo. (7.74) 
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For the electromagnetic case, J^ must satisfy a continuity equation. 
That is 


ec es e 


Neri Ê =a") 


F” /—9). 





f= = Ox ox 


This last term equals zero because of the symmetry of the dummy indices. 
Therefore for the electromagnetic case 


1 à 
—— — (Vy -gJ*) = 0. (7.75) 
Jogan" 


For the more general Yang-Mills equation, we can show that 





1 
V —gJ*) = 0. (7.76) 


To prove Eq. (7.76), we first note that it can be shown that 
OAg _ 


ax” ox? a? )u 


0A, ôA 
—igM| — P 4 ig[A,, A ) 
iq (24 - m iq[ p] 


= iq[ E54, M]. (7.77) 








(V,V, — VjV)M = iq (n 








To prove Eq. (7.77), we merely apply Eq. (7.66) twice and then discover that 
a lot of terms cancel out. In turn, Eq. (7.77) implies that 


(V, V; — V, Vf F*^) = iqLF,s, fF") 

= igf(FipF* — FE) 

ig f (Fg, g, F” Pe F*9F s) 
ig EP ha — FE) 

- 0. 


II 


Because of the symmetry of the dummy indices, this implies that 


V, Vi f F*^) = 0. (7.78) 
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A special case of this result occurs when f = ./—g. Thus 


VG -gF^") = 0 


or 





NET | T «7r = V,(./—gJ*) = 0. 


In the usual presentation, Einstein’s field equations have a quite different 
appearance. Elsewhere in this text, we only deal with the vacuum equations: 


Rap = 9. (7.79) 
For the more general situation, we have 
Rag = ZRgag = 811, (7.80) 


where Tag is known as the energy-momentum tensor. 

Because of the nature of the left-hand side of Eq. (7.80), it is required 
that T" , — 0. This is usually regarded as the analogue of the electro- 
magnetic equation J?., = 0. 

It is important to note that when T,, = 0, R^, — 2Róf = R — 2R = 0 so 
R = 0 and Eq. (7.80) reduces to the vacuum equations. 

Einstein at one time considered a slightly more general form. That is 


Rag = ZRgag + Agag = 821,5 (7.81) 


where A is known as the cosmological constant. From time to time, 
observations have been made that suggest that the cosmological constant is 
not zero but that it is the subject of an ongoing debate. 

The format of Eq. (7.80) or (7.81) has a quite different appearance than 
Eq. (7.73). This suggests that the gravitational field is not a Yang- Mills field. 
But let us look at the situation more closely. 

First of all the Fock-Ivanenko 2-vectors were introduced by Fock and 
Ivanenko to make Dirac's equation compatible with the dictates of general 
relativity. In this context, they may be regarded as components of a gauge 
field. (See Problem 8.4) 

Secondly, for a coordinate frame, we have 


ô 
VM =>; M -T,M + MT, (7.82) 
X 
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and 





ð ô 
T, Lr 7.83 
ax? B ex^ a m, 51 ( ) 


320,5 EX 
where By, = 3R,g,,)"". 
Thus we can identify —T, with igA, in Eq. (7.66) and 122, with —iqF,, 
in Eq. (7.65). From Eq. (7.67), we then have 


VR yy +V Ba + V, Ray = 0. 


It is interesting to note that this corresponds to the usual Bianchi identity 
in general relativity. It is not too difficult to show that 


VR nv = 3R uvogia * + HRvopl an” F Rl y. (7.84) 


If we sum over a cyclic permutation to the indices y, v, and o, we find that 
the terms generated by the second term in Eq. (7.84) cancel out. Thus 


V, Buy + VB va + V Ban = HR opi s TI addis + Raus)? * 
=0. 


Formally we could derive an equation of the form 


1 
—— Vv - gidt?) = P’ 


by a variation of I, in the action 
S= [oana + FT, )on/ —g dx? dx! dx? dx3. — (7.85) 


However, in the context of general relativity the independent field is 
considered to be the metric tensor g,, and not the gauge field T,. It is true 
that T, depends not only on the metric tensor g,, but also on the choice of 
underlying orthonormal frame (Vierbein). On the other hand the curvature 
2-form and therefore the action of Eq. (7.85) depends only on the metric 
tensor. Thus it is not clear to me that it makes sense to carry out a variation 
of this action with respect to I. 

Perhaps one could consider the possibility of permitting generalized 
versions of T, and then carrying out a variation of the action with respect 
to F, and g,, independently. An approach similar to this has been applied 
to a somewhat different action. A discussion of this procedure for the Palatini 
action is carried out in Robert M. Wald's General Relativity (1984, pp. 
450—459). (If you refer to Wald's text, you should be cautioned that the 
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operator V, that Wald uses is quite different from the version used in this 
text.) 

The nature of these relationships or nonrelationships leaves the status 
of gravity as a Yang-Mills field subject to debate. Nonetheless we can 
show how Einstein's equations (or a slight generalization) can be cast in the 
form of Eq. (7.73). We first note that the derivation of Eq. (7.78) applies to 
42%”, This implies that 


5 v(- 4 ( 5 arare) - 


which means that if we define 





1 
F = = VLR") 


Ja 


then ,Z* may be considered a “conserved current" in the sense that 


1 
—— VAV 799) = 0 
v Tg 
Consider 


VG f RP) = 3V S RP, ym 


0f a v a v a v a v 
= GL R = y" T fR np —fR P vl poly” EE fR P wT ge re) 


0 
~4 5 Re, F fR? v.g — SR E o Tp m uer) 
x’ 


Since 
p 0 8 6 
RY yg = RUSSE RU RT! — RSS! — RIS, 


we can write 
ð 
VES R!) = (Z f GRE + SR” mip 


— SR? Ty — SR” mTpo )r (7.86) 


Because of the symmetries of the dummy indices 


R” Tg" ax 0. 
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Furthermore 


"ORRE: SE 
To vat ct 





Thus Eq. (7.86) becomes 


1 ô à 
VGS RI) = iU RY gt (35 - = 24 Ja )R n). (1.87) 


Because of the Bianchi identity 


aß — _ RÆ — R2 
R SOR viu R 


Bni y 


= es F Rs 


"vf 


Thus R^. , = 0 when R*, = 0. 
Therefore we can require our “current” to be zero for the vacuum state 


if we set f = ./—g. With these results, we have 


1 





diia Velo) IR”) = GR, y". (7.88) 


i 


From Eq. (7.81), we have 
R*, — 5R6% + A? = 8nT*, 
and thus 
R—2R + 4A = 82T 
or 
R= 4A — 82T 
where T = T”, Thus 
R*, = Ad§ + 81(T*, — 576%). 


Therefore 





Eo dg 
E] 
R 
QU 


E3 
R 
is 
< 
so 
* 
* 


I 
AR 
a 

A 
d 


Tor) yt 
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To summarize: 


Q 


ox* 


V I= 





L 1-LD J+0 IW. (7.89) 


where [ ] represents any Clifford number; 


0 0 





IR g = F T, L, T 7.90 
2*^* af ax B ax a [ a 3] ( ) 
V,Biy + VR, + Vj Ray = 0 (7.91) 


1 


Jg 





VG gR”) = IR", gy" 


= ZR” 4 
= aR oy 
=f" 
= 4n(T*, — ATÒ), oy (7.92) 
and 
1 
J a= E VAV —9F") = 0. (7.93) 
=g 


These last five equations are all characteristic of a Yang-Mills theory. 
Equation (7.92) in the form 


JR*, y" = An(T*, — FTS," (7.94) 


may be considered a slightly generalized form of Einstein's equations with 
the possibility of allowing a nonzero cosmological constant. 

Generally we solve Einstein's equations for the metric tensor. However, 
it is conceivable to this author that one could solve Einstein's equations for 
the Fock-Ivanenko coefficients and then use Eq. (5.42) to solve for the metric 
tensor. In practice this probably would not be an easier method to find 
solutions. However, the theoretical possibility of such a route would lend 
credence to the suggestion that Einstein's theory of gravitation can be 
considered to be a Yang- Mills theory. 
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Problem 7.12. Assume 


V = (2; + A, Jy 
ex" 
y = My 

(sti) 

V = | — + igA, Jy’ 

ox" 
V,My = (V,M)y + MV,y. 
Show 


ô : 
VM x Sat + iq[ A,, M]. 


Problem 7.13. Confirm Eq. (7.77). 


Problem 7.14. Show R, = 2RP, ,. Hint: apply the Bianchi identity to RE 
and then contract the result on two indices. (This implies that the energy- 
momentum tensor in Eq. (7.80) must satisfy the relation T”, g790) 


74 The Migma Chamber of Bogdan Maglich 


The intent of this section is to walk the reader through an exercise which 
demonstrates both the advantages of using a coordinate system with the 
symmetries of the problem and the pitfalls of using a nonCartesian coordin- 
ate system. There is no Clifford algebra contained in this section. 

One scheme to produce fusion energy has been advocated by Prof. 
Bogdan Maglich (Macek and Maglić 1970; Maglić et al. 1971; Maglich 1973; 
Gordon and Johnson 1974) In this scheme deuterium or some other 
appropriate kind of ions are set on self-colliding orbits confined by a 
cylindrically symmetric magnetic field. Any student who has studied one 
year of college physics knows that for a cyclotron where B, — B, — 0 and 
B, is constant, charged particles have circular orbits if they are set in motion 
in the xy-plane. 

In the median plane of Prof. Maglich's migma chamber near the axis of 
symmetry, B, = By(1 — ko?) where p? = x? + y?, k > 0, (1 — ko?) > 0, and 
it is hoped the ions will be confined to a region where this is a good 
approximation. In such a field, the orbit of a charged particle will have a 
shorter radius of curvature near the axis of symmetry where the magnetic 
field is strongest and a longer radius of curvature away from the axis of 
symmetry where the magnetic field is weaker. For this reason, the orbits are 
no longer closed paths. Thus a crucial question arises. Will a charged particle, 
which passes through the axis of symmetry once, repeatedly return to this 
point where it has a chance to collide with other particles following similar 
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paths or will it eventually spiral out and away from the axis of symmetry? 
This problem can be dealt with in cylindrical coordinates. We note that 





GA, Ag, 
E,—F4,————— —0 for x* = p, 0, and z. 
KT eae dut ái 
0A, 0A, 
B.-—E.-— z 4 i 7.95 
4 "E 00 êz a) 
0A 6A 





& 
ll 
S 
ll 


2 a 7.96 
K Oz 0 a 


0Ag OA 
a Rr age 
p 


(7.97) 


Since the electric field E — 0, there is no need to introduce a non-zero 
Ay nor a time dependence in A,, Ay, or A,. Because of the cylindrical 
symmetry, there is no reason to introduce any 0 dependency in the vector 
potential. Furthermore, because of the same symmetry, B, — 0. Thus Eqs. 
(7.95)- (7.97) becomes 


OA, 
=—, 7.98 
P az ( ) 
B, = EL = 0, 
oz 
B, = cede (7.99) 
op 


Thus, the only component of the vector potential Z which is necessarily 
nonzero is A,(p, z). To solve Eq. (7.99), it might be thought that B, should 
be the same in cylindrical coordinates as it is in Cartesian coordinates. 
However, * B," is a very poor notation for what is actually a component of 
the second-rank Faraday tensor. In Cartesian coordinates for our problem, 
Fio = Foo = Fao = F;3 = F3, = 0, and Fy, =F, = —F,, = —Be(1 kp’). 
(Actually for z 4 0, neither F,, nor F4, would be zero.) At any rate, in the 
xy-plane 








Ox ô Qy Ox 
Foo = Fy + yx z 
op 00 6p 00 
= F,,(cos 0Y(p cos 0) + F,,(sin 0)( — p sin 0), 
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Or 
E, = pF, = —Bo(p — kp?). (7.100) 
We also know that 
Fg = z Y 
so 
A= a, 6) = Ae (1 ue) (7.101) 
2 4 2 2 


Actually to guarantee confinement in the z-direction, one should assume 
some z dependence in the function A,. At any rate, for the cylindrically 
symmetric field, 


L = me [ei — (Y. — (96)? — Gy] + q0Aq(, 2). — (7102) 


Since L does not explicitly depend on t or 0, two constants of motion are 


2 = me? (ct) = p,c? (7.103) 
c ôt 
and 
E = —mc^p?ü + qA, = —psc?. (7.104) 


We also know that since 
(ds)? = c?(dt)? — (dp)? — p^(d0)* — (dz)’, 
it follows that 
c2(t)? — 9? — p?6? — z? = 1. (7.105) 
Combining Eqs. (7.103)- (7.105), we get 


A?p? — (cpg + q4) 
(mc*)*p? 





prm ; (7.106) 


where 


A? = ((p,)?c* — m?c*) = (mc?yw? = (mev)?/(1 — (v/c)?). (7407) 
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Bg > 0, p, <0, k=0 


Fig. 7.2. Circular orbit of a positive ion in a uniform magnetic field when the constant 
generalized angular momentum p, = mp?0 — qA,/c? is negative. 


In the case for which z = 0 and k = 0, Eqs. (7.104) and (7.106) become 





A B 
and 
aisi qi 2 
y = p^ — (c Po + qAo) (7.109) 


(mc?)p? 


If By > 0, we have three cases. (See Figs. 7.2, 7.3, and 7.4.) 





By>0, p, 20, k=0 


Fig. 7.3. Circular orbit of a positive ion in a uniform magnetic field when the generalized angular 
momentum ps 1s Zero. 
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By > 0, p, > 0, k=0 


Fig. 7.4. Circular orbit of a positive ion in a uniform magnetic field when the generalized angular 
momentum pọ Is positive. 


If p, < 0, then from Eq. (7.108), Ê remains negative throughout the orbit. 
The coordinate p has two critical points when 6 = 0. At all points in the 
orbit, the magnetic field has the same sign so the curvature never reverses 
sign. This means that at these critical points p reverses its sign and p attains 
a maximum or minimum value. According to Eq. (7.109), there are four 
values of p which make 6 = 0. But two of them are negative and have no 
physical meaning. The two remaining roots are 





ATA A—A^ 
À ) and ju det 


PMax = 
qBo qBo 


where 
A — x A? T 2qBopec?. (7.110) 


(See Fig. 7.2.) — 
If p, = 0, 0 again remains negative throughout the orbit. In this case 


2 B 2 
(meh (gy? = 4? -TE pa, 
The value of ù is zero at only one point of the orbit. This occurs when 


2A 
P = Pmax = oou 
qB, 


The minimum value of p is 0. At that point p is discontinuous. In particular, 
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the value of p jumps in value from — A/mc? to + A/mc? as the particle passes 


through the origin. (See Fig. 7.3.) 
If py > 0, then from Eq. (7.108), 0 < 0 for 


2 2 
p> Jépoc" 
Bo 
while 6 > 0 for 


2p,c? 
Bo 





p< 


As in the first case, the coordinate p has two turning points where p = 0. 
They are 


: A-a 
PMax = 41B, PMIN qB, > 








where A is defined by Eq. (7.110). 
In all three cases, the orbit of the particle is a circle with radius 


pede: Hati (7.111) 


q4Bo  qBy /1— (v/c)- 


The choice of the cylindrical coordinate system would be a very poor choice 
if our goal was to show that these orbits are circles. This can be done far 
more easily in Cartesian coordinates. (See Problem 7.15.) The advantage of 
using the cylindrical coordinate system is that it is now possible to see what 
happens when B, (in Cartesian coordinates) is By(1 — kp) instead of Bo. In 
that case A, is —4B)p7(1 — 1kp?) instead of —4Byp?. This has the effect of 
replacing By in Eqs. (7.108) and (7.109) by By(1 — 4kp7). 

Since k is small, this causes only a slight shift in the turning points of 
p in each of the three cases discussed above. It is still necessary for p, to be 
zero in order for the particle to pass through the origin. 

The most significant change is that the paths are no longer circles. Since 
the radius of curvature is inversely proportional to the strength of the 
magnetic field, the path of a particle will straighten out somewhat when the 
particle recedes from the axis of symmetry. Thus a sort of precession of 
the orbit will occur. (See Figs. 7.5, 7.6, and 7.7.) Although the orbits are no 
longer circles, it is clear that these orbits remain bounded. Furthermore, 
because of the precession of these orbits, particles with positive values of p, 
will tend to collide head-on with particles with negative values of pọ. 
Therefore if a large number of ions can be set in orbits that pass near the 
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y 


Bp» 0, Pa <9, k»0 


Fig. 7.5. Self-colliding orbit of positive 10n in a nonuniform Maglich magnetic field when the 
generalized momentum p, is negative. 


axis of symmetry, one can reasonably expect to get some collisions and fusion 
events. 


Problem 7.15. It is shown in virtually every Freshman level college physics 
course that without relativistic considerations the orbits of ions in planes 
perpendicular to a uniform magnetic field are circles. It can be shown that 
this fact remains true according to the laws of special relativity. 

Suppose E — 0 and B = (B,, B,, B,) = (0, 0, Bo). Consider a charged 


Bo > 0, Pa 7 0, k>O 


Fig. 7.6. Self-colliding orbit of positive ion in a nonuniform Maglich magnetic field when the 
generalized momentum pg Is zero. 
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y 


By > 0, pg > 0, k <0 


Fig. 7.7. Self-colliding orbit of positive 1on in a nonuniform Maglich magnetic field when the 
generalized momentum p, is positive. 


particle with initial world velocity u(0) = (u°(0), u'(0), u?(0), 0), where 


v(0) 


(oL wi@= 
JA — (v/c)* c./1 — (v/c)? 








and 
v,(0) 


e/l- (v/c)? 


(1) Use the result of part (4) of Problem 7.8 to compute u(s). 

(2) Integrate both sides of the equation x(s) = u(s) to obtain a formula 
for x(s) in terms of x(0) and u(0). 

(3) Show that 


u?(0) = 


(mcv)? 


(qBo)*(1 — (v/c?) 





(x!(s) — a)? + (x*(s) - b? = 
and 


a = xi(0) eee 2B, * 12(0), and b=x?(0)— 2B, Le 


7.55 "The Generalized Stoke's Theorem 


In this section I will use both the terminology and notation of differential 
forms. In Eq. 7.19, it was observed that an upper index coordinate Dirac 
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matrix arises when we apply the exterior derivative d, to a coordinate. That is 
y* = y? ^ Vax" = dx*. (7.112) 


In the formalism of differential forms, dx” is a /-form and 


1 
€ = > Oz de dx^ A dx"? ^A... ^ dx” 
p! 


1 
-—6G ap dx** dx”... dx?» 


! q192... 


is a p-form. 

In the formalism of differential forms, there is a meaningful distinction 
between p-vectors and p-forms. However, this distinction will not have any 
impact on the particular computations carried out in this section. 

The generalized Stoke's theorem is 


| ix =| F (7.113) 
y oV 


where F isa (p — 1)-form, dF is a p-form, and GV is a (p — 1)-dimensional 
boundary of the p-dimensional volume V. For p — 2, this is the usual Stoke's 
theorem. For p — 3, this is generally known as Gauss's theorem. Essentially 
what the theorem says is that in any dimension there are special cases for 
which one can convert a volume integral to a surface integral. 

Because a surface has an orientation, some care must be taken in the 
evaluation of a surface integral. This section is devoted to proving the 
generalized Stoke's theorem for volumes that have shapes that are easy to 
deal with. In the process of carrying out the proof it is also our hope to 
make the surface integral on the right-hand side of Eq. (7.113) computation- 
ally meaningful. 

To underline possible pitfalls, let us consider an example of what the 
theorem does not say. Suppose 


F = F, dx! + F,dx?. (7.114) 
Then 
OF. OF 
dF = — dx! dx? + — dx? dx!. (7.115) 
ôx! ôx? 


So far, so good. 
Now let us try to compute fy d¥ where V is the 2-dimensional region 
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xhax(x?) while the path BAD 1s 


Fig. 7.8. The path BCD is defined by the equation x! 


defined by the equation x! = x(x’). 


shown in Figs. 7.8 and 7.9. Our initial inclination might be to write 


F 
es dx? dx! 
Ox? 


dx! dx? + | 
V 


ôF, 
ox} 


pue 


2 
E ^as) dx? 


ôx! 


©) F (xt, 


1 
MAX 
1 
2 
Xu) 


‘| 


2 
*MAX 
2 
XMIN 


‘| 


2 
i 4x") dx!, 


ox? 


MAX Max) OF (xt, 
+ 
Suit xan?) 





x! 





XMIN 








Fig. 7.9. The path ABC is defined by the equation x? = x2j (x!) while the path ADC is defined 


by the equation x? = xg4x(x!). 
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or 


| dF = | By Gduax(x2), x?) dx — | 0 FG (x2), x?) dx? 
y : 3 


XMIN XMIN 


«| Y E xl, x (x!) dx! — | "P E Gl, xda(x!) dx. (1.116) 


MIN XMIN 


(You should refer to Fig. 7.8 to obtain an understanding of the notation 
used for the first two integrals on the right-hand side of Eq. (7.116). You 
should also refer to Fig. 7.9 to get a handle on the notation used for the last 
two integrals of the right-hand side of Eq. (7.116).) 

What we now have is four path integrals: 


| av - [rae [raea [n dx! - [5 dx 
V 
D D 
D C 
Je ad ved 
B A B aal Ne B 
that is 


faz = [5 dx? + |^ dx!. (7.117) 
eo oO 


Something here is clearly amiss! On the right-hand side of Eq. (7.117), we 
have one counterclockwise path integral and one clockwise path integral. 
This does not look like Stoke's theorem. The usual Stoke's theorem tells us 


that 
OF, ÖF 
fx dx! + F, dx? = s — es) dx! dx?. 


e e 


Thus it appears that we should have first written Eq. (7.115) in the 
form 


dF = (s — 2m) dx! dx?. 


Ox! | Ox? 


Then we should formally replace the 1-forms dx! and dx? by the scalar 
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differentials dx! and dx? under the integration symbol. We would then 


have 
[as = [(05- 25) ant a = [naf ena = | F. 
v ôx? ôx av 


e e 


We now turn to the problem of constructing a set of sign conventions 
which make the generalized version of Stoke's theorem meaningful. It is 
generally not too difficult to get a handle on a volume integral if the shape 
is not too horrendous. However, a boundary integral is another matter. In 
the example just discussed, our boundary integral was a counterclockwise 
integral. But what does “counterclockwise” mean in higher dimensions? 
Most of us have a pretty good idea of the orientation of a 2-dimensional 
boundary to a 3-dimensional solid. However, beyond three dimensions, my 
intuition fails—at least in this context. 

Our approach in this section is to first codify the meaning of a volume 
integral. This will then define what is meant by 


f az. 
y 


For simply shaped volumes, we will carry out the obvious partial integrations 
indicated and thereby determine what is really meant by the boundary 


integral 
| F. 
av 


What we then have is not a theorem but a workable definition of a boundary 
integral. 

For the integral of an n-form, there is only one possible definition that 
is both plausible and simple. Suppose 


1 
F = — Faa an AX" UX”... dx^ 


ni a1a2 


= F2 adx! dx? ... dx”. 


To compute the integral of 4, one formally replaces the n-form 
dx! dx?...dx" by the product of scalar differentials dx! dx?... dx" and 
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writes 


| z-| Fə .dx!dx?...dx" 
V V 


-| id Es x UM (Fay, dx!) x)... dx", (7.118) 


MIN MIN MIN 


where it is understood that 


k k k+1 k+2 
XMAX(MIN) = XMAx(MIN X o X", X) fork =1,2,...,n—1. 


(To make things easy for ourselves, we will consider only volumes such that 
we can use a single coordinate system. That is we don't have to patch together 
different coordinate systems to cover the entire volume. Furthermore, we 
will assume that the shape of V with respect to our chosen coordinate system 
is such that if one considers curves obtained by freezing n — 1 of the xs 
and letting the remaining coordinate vary, then any one of those curves will 
either cross the surface of the volume at exactly two points, touch the surface 
tangently at one point, or miss the volume entirely.) 

For a p-form, the situation becomes considerably more complex. Here 
we would have 


1 
| sal Faaa.. a, D dh, dx, (7.119) 
y pi Jy 


On the face of it, this could be the sum of (5 distinct integrals. However, 


p 
we will assume that the p-dimensional volume V can be parameterized with 


p parameters ul,u?,...,u?. That is, for any point in the p-dimensional 
volume, 


x* = x'(ul u?,...,uy) fora — 1,2,...,n. 
In this situation, 


ai a2 ô 
dx“ dx”... dx% = DES ON ] M du^: du^? .. . du^, 
Qu^: Qu^ ` Ouf» 





and Eq. (7.119) becomes 


1 


[> -31 F5,5;...5, du^ du’... du”? 
p! 


=| Fis. F^,.., du! du... du”, (7.120) 
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where 


Ox" Ox? x 
F1... fy =F, 


meee 7421 
A182.. Gp eu Qu? Qu» ( ) 


Dropping the prime, we now define the integral to be the resuit of 
formally replacing the p-form du! du*...du? by the product of scalar 
differentials du! du? .. . du" and then carrying out the indicated integration; 
that is 


| s- f F,5,.., du! du? ... du? 
V V 


=| «m “(| ^" (fa. du) ue)... du’, (7122) 


p 2 
MIN MIN MIN 


where it is understood that 


UA X(MIN) = uhaxmmn (^ * T, MPP? ..., uP) for p= 1,2,...,p. 
Now let us turn to the problem of evaluating the integral |, dF, where 
V is a p-dimensional volume. Much as we did before, we will assume that 
the coordinates of the volume and its boundary can be parameterized in 
terms of p variables. That is, for any point in the volume or on its boundary 


x* = x%(ui,u?,...,u?) fora=1,2,...,n. 


Since we are concerned with the (p — 1)-form ¥ only on the boundary of 
V, we can use the same p variables to describe F. That is, we can write 


p ~ 
F = } Fr..g...pdu' du?...du*... du’, (7.123) 


where the caps over the index k and du* indicate missing entities. Using this 
notation, we have 


F5, 4... du* du! du? . . . du? .. . du? 
p ð 
= Y (DU LE, oí du! du? ... du"... du”. (7.124) 
u 


Now from our previous definitions, we have 


p 
| dF => cure fo [fra end! du2...du? (7.125) 
y k=1 u 
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Singling out a particular value of k in the summation in Eq. (7.125) and 
integrating over u*, one has 


Ó 
| | Mina. i... du! du? .. du? 
u 
= ]: [frr k. pleas eos Ukaxa U”) 


70 
. 1 42 k 14,42 k 
— Fi, f, ple, ^oc umo s UP) | du du^... du"... du”, 


where 


=) 


k — yk 1.232 p 
UMAX(MIN) = UMaxeuNK(U 5 55. ssl ). 


Thus if we require that 


p PA 
| # =| X Fia, edut du.. hs du = | dz, 
av ô v 


y k=1 


then we are essentially forced to define 
| FQ g ,du dw^...du*...du? 
ev 


-(-10*! | ` [rF UM U^ ves UMax + o UP) 
— Fia. g. (Ut, u?,..., ss uPy] du! du? ... duh... du^. (7.126) 
This is the main result of this section. 
Problem 7.16. Show that Eq. (7.121) can be rewritten in the form 


j _1 F O(x^!, x77, 0.2, x7") 
12. p ^ ,^*a1a2...Xp 1 2 * 
p! O(u!, u^,..., uP) 





where 


O(x^!, x77, ..., x”) 
O(u!, u?,..., uP) 





is the Jacobian of the coordinate transformation. 


Problem 7.17. Show that the integral of a p-form as defined by Eq. (7.122) 
is independent of the choice of variables used to parameterize the volume V. 
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Problem 7.18. In vector notation, Gauss's theorem is 


| (FA) ds = | V-Fay, 
V V 


^ 


where # is the unit vector perpendicular to the surface in the outward 
direction. Show that this theorem is encompassed in the generalized Stoke's 
theorem if we make the identifications F, = F,,, F, = —F,4 = Fj, and 
F, = F,,. (Aside from checking the various sign conventions, one also needs 
to verify at least informally that 


| Fn, ds = [ross z) ys z) dy dz — | [peso z) y z) dy dz 
y 


and similar relations for |, F,n, ds and fy Fn, ds.) 


Problem 7.19. Apply the generalized Stoke's theorem twice to show that 
fov F = 0. (What does this say about 00V?) 


8 
DIRAC'S EQUATION FOR THE ELECTRON 


8.1 Currents and Dipoles in Curved Space Resulting from Dirac’s 
Equation for the Electron 


P. A. M. Dirac (1928) introduced what are now known as Dirac matrices 
to formulate his equation for the wave function y of an electron: 


(te i £ A) = —mcy. (8.1) 


10x" c 


It is understood that y"A, = o is the 4-vector electromagnetic potential 
encountered in Section 7.2. 

The usual approach is to introduce specific 4 x 4 matrix representations 
for the Clifford numbers and then look for four component column vector 
solutions. The disadvantage to this approach is that the components of these 
vector wave functions depend on the matrix representation chosen for the 
Clifford numbers. This means that the components do not lend themselves 
to physical interpretation. 

Soon after Dirac’s paper appeared, several physicists presented formula- 
tions which did not depend on specific representations of the Dirac matrices. 
These include Eddington 1928; Proca 1930; Sauter 1930; Franz 1935. In 
the second volume of the 1939 edition of his text Atombau und Spektrallinien, 
Arnold Sommerfeld devoted over 100 pages to the application of Clifford 
algebra to the Dirac equation. This is focused on solutions written in terms 
of Clifford numbers. Nonetheless, this approach did not attract a large group 
of users at that time. 

However, almost 20 years later, when discussing solutions of Dirac’s 
equation for the Coulomb potential, Paul C. Martin and Roy J. Glauber 
(1958) found that from a computational point of view it is useful to rely on 
the algebraic properties of the Dirac matrices rather than on any specific 
matrix representation. 

In recent years, physicists have developed a bunch of algebraic tricks 
and computational shortcuts in the manipulation of Dirac matrices for 
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Minkowski 4-space. Many of these are summarized in a paper by Alberto 
Sirlin (1981). 

To obtain solutions of Eq. (8.1) in terms of Clifford numbers, it is clear 
that we need to deal with linear combinations of p-vectors with complex 
coefficients. This requires some new definitions. 

To obtain the complex conjugate of a Clifford number .«Z, we take 
the complex conjugate of each coefficient of each product of Dirac matrices 
that appears in the sum representing 2. That is, if 


A — IA + y, A' ag YA” + yi AU + Yi, AP", (8.2) 
then 
A = IA + y A! + yg) + yu AS F Jig, AU, (8.3) 


where Aï- denotes the complex conjugate of 4^5» 

It is also useful to define the complex reverse s/t of a Clifford number. 
The complex reverse of a Clifford number . is obtained by taking the 
complex conjugate of Z and then reversing the order of all Dirac matrices 
that appear in a-product. Thus if Z is represented by Eq. (8.2), then 


At — IA + y; A + y, AU * yy; A F Yenc (8.4) 


If 4t = of then we will say that Z is c-Hermitian. If 4t = 471} then 
we will describe Z as being c-unitary. 

Since the complex reverse reduces to the ordinary reverse for real Clifford 
numbers we can use the same notation to define a complex scalar product. 
One simply writes 


LA, BY — (AB), (8.5) 


where as before (.«75 22), designates the 0-vector component of 4+2. 

Most of the work done on the Dirac equation has been done for the 
flat space metric of special relativity. However, Vladimir A. Fock and 
Dimitrii Ivanenko (1929) pointed out that Dirac’s equation could be adjusted 
for curved space. They did this by introducing what have since become 
known as Fock-Ivanenko coefficients. These were discussed in Chapter 5 of 
this book as Fock-Ivanenko 2-vectors. With the Fock—Ivanenko adjust- 
ment, Dirac's equation for curved space becomes 


(ones ^) = my. (8.6) 


From Eq. (5.44), we know that 


Q 
-sy ]-WL 1-I 15 (8.7) 
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where any differentiable Clifford number can be placed in the square 
brackets. Using this equation, one can rewrite Eq. (8.6) in the form 


imc 
TV — YB, = VE V. (8.8) 


where 


ie 
28, — Y, — E A,. (8.9) 


From these equations it is not too difficult to construct a covariant 
current. If one takes the complex reverse of Eq. (8.6), one gets 


(Vj — anh = me y. (8.10) 


Since T, is a 2-vector with real coefficients, it is clear from Eq. (8.9) that 


B= -By (8.11) 
If we substitute this into Eq. (8.10), we have 
(V! yr + art Sn yt (8.12) 


If we now multiply this equation from the right by v and multiply Eq. (8.8) 
from the left by y^ and then sum the two resulting equations, we get 


(Vj yr + yy vu + 1,7) — Wy WB, = 0. (8.13) 


In general the 0-vector component of the product of any two Clifford 
numbers is identical to the 0-vector component of the product of the same 
two Clifford numbers when the order of the product is reversed. This means 
that if we project out the 0-vector component of each term in Eq. (8.13), the 
projection of the last two terms will cancel one another out and we obtain 
the relation 


QV, ) yy + WV) = 0. (8.14) 


For an index free Clifford number such as y, we know that the covariant 
derivative y,, is equal to V,y. Furthermore y^.,--0. Thus Eq. (8.14) 
becomes 


(y? uy + wy" Ww + Wye ndo = 0 
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or 
(WII); no = 0. (8.15) 
Since taking the covariant derivative of a p-vector is a p-vector, one may 


compute the covariant divergence of yy" before or after projecting out the 
0-vector. Therefore Eq. (8.15) can also be written as 


From Eq. (8.16), it is clear that if we define 


J" = e(J ur), (8.17) 


then J" may be interpreted as a conserved current; that is J" satisfies the 
continuity equation: 


J" , = 0. (8.18) 
Suppose we again use the fact that one may exchange the order of the 


product of two Clifford numbers before projecting out the 0-vector compo- 
nent of the product without changing the result. Then 


Wyo = Qi) = QV» 
or 
J" = eQr wp"). (8.19) 
Since <y”, y,» = 6%, it is possible to expand the product yyt in terms of its 


p-vector components and simply read off the four components of J" from 
the 1-vector components of yy. That is 


epy? = IA + yoJ9 +y, J! + y4J?  y4J? + p-vectors of higher order. 
(8.20) 


Using what is known as the Gordon decomposition it is also possible 
to interpret the 2-vector components of yt (Gordon 1928). 
Closely following W. Gordon's computation we first note that 
yy = Aah) + 36). (8.21) 
From Eq. (8.8), we have 


ih 
y = (Va — y Wa.) 
mc 
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or 
ih. : 
y = —Q,- 7v8,). (8.22) 
mc 
Taking the complex reverse of this last equation gives us 
t ih T ayy tov 
w= 2Y Y! + 8,477"). (8.23) 


If one now substitutes the right-hand side of Eq. (8.22) into the first term 
on the right-hand side of Eq. (8.21) and one also substitutes the right-hand 
side of Eq. (8.23) into the second term on the right-hand side of Eq. (8.21), 
one obtains the relation 


ih 
y yy = — Wty", — yy WB, 


2mc 
= wry — 0 yn). 
Since y”y” = y™ + g'" T, this last equation can be rewritten in the form 
Vr = LU es + mo 
2mc 
+97 WY, — WW) 
— 9 (WB, + 2, V) 
— Ty WB, — 4, yy). (8.24) 


Since 7""., = 0, the first pair of terms on the right-hand side of Eq. (8.24) 
may be combined into a single term (yw)... In addition, if we project 
out the O-vector component of both sides of Eq. (8.24), the last pair of terms 
will cancel out. We then have 


J” = e(l) 
ieh 
- e (V), o 
mc 


Tus g" [Q^ Y, — W Wo — UWB, + 2,4 Vo]. (825) 


2mc 


Following the Gordon interpretation, the first term on the right-hand 
side of Eq. (8.25) can be interpreted as an internal current: 


ieh ieh 
Jh = 5 (QW), o = = (OW), o: (8.26) 
mc 2mc 
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From the form of Eq. (8.26), it is clear that Jir is covariant. The fact 
that Jj, also satisfies the continuity equation is not so obvious. To show 
that Jiu, satisfies the continuity equation, it is useful to again note that the 
operation of computing a covariant derivative of a Clifford number com- 
mutes with the operation of projecting out the 0-vector component of the 
same Clifford number. Therefore 


Jinr = M, (8.27) 
where 


ieh 
M" = LE (orit). (8.28) 
2mc 


To show that Jir satisfies the continuity equation, we must show that 
M™.,., = 0. Since M" = —M"", we know that M=3M""y,, is a 2-vector. 
From Ea. (7.17), 
0.4 — M”, Yn and 60. — M"... I. (8.29) 
However, from Eq. (7.20), we know that 66.77 = 0. Thus 
JNT; = M” v 0. (8.30) 


The second term on the right-hand side of Eq. (8.25) was interpreted by 
Gordon as a convection current Jony. Thus we write 


ieħ 
Jtony = T IEW Y; — v; oo — 202, oT. (8.31) 


This also satisfies the continuity equation since 


J” = Jint + Jéonv and Ju 


= Jr = 0. (8.32) 


The internal current Jir is considered analogous to the current due to 
bound charges mentioned at the end of Section 7.2. In the same spirit the 
components of M"" are interpreted as the electric and magnetic dipole 
moments of the electron due to inhomogeneities in the external electro- 
magnetic field or due to motion of the electron. For example 


P, = -M = 28 yt, (8.33) 
2mc 
and 


M, = -MU- Wo. (8.34) 
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It is worthy to note that Eq. (8.34) is identical to the equation 


M=—S (8.35) 
where 


(Sx, Sp Sz) =F Q7, y!*, y! ?y». (8.36) 


Equation (8.35) is considered to be valid for electrons except for small 
radiative corrections. 

With Eq. (8.20), we saw that it was possible to interpret the 1-vector 
component of ey. Equation (8.28) enables us to interpret the 2-vector 
component. According to the result of Problem 8.1, we know that 


(y so = on. 


Using this result along with Eq. (8.28), it is not too difficult to verify the fact 
that 


ey! = 1A + Shy, — = My 
+ a pseudo-vector and a pseudo-scalar. (8.37) 


From Eq. (8.37), we see that a knowledge of yyt immediately gives us 
the total current J* and the components of the electromagnetic dipole tensor 
M*. Since Jiyy = M*,, and Jéony = J" — Jir, it is clear that the internal 
and convection currents may also be extracted from a knowledge of the 
product ya^. This suggests to me the possibility that all measurable entities 
are encompassed in the product yyt. This in turn suggests the possibility of 
a c-unitary gauge transformation which is pursued in Problem 8.4. 


Problem 8.1. 


(1) Convince yourself that 


E (PP PPP Des so oo 


Jija. . z 
kik2.. 


(2) Use the result of part (1) to show that 


Cyt Pe Bs Bp? = piba.. bp (8.38) 
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(Hint: you may wish to use the relation 


grim tp = (we Wa... Wiz) piece 
-(Wnwm...Wiypnge...$m) 


Problem 8.2. Show that if fọ is represented by a Hermitian matrix and f4, 
$, along with ?4 are each represented by an anti-Hermitian matrix then 
$1 = fof *%o for j = 0, 1, 2, or 3 and $7 is the complex transpose of f. In 
addition, show that this result can be generalized to all Clifford numbers 


associated with Minkowski 4-space; that is 2t = fos * ĵo. 


Problem 8.3. Use the result of Problem 8.2 to show that if Y is c-unitary and 
M is a Clifford number associated with Minkowski 4-space then UPU * — $,. 
(Since in some presentations, 


1 0 0 0 
7 0 1 0 0 
Yo = 

0 0 -I1 0 

0 0 0 -1 


this last result implies that for Minkowski 4-space, the group of c-unitary 
Clifford numbers is isomorphic to the group U(2,2).) 


Problem 8.4. (A possible gauge transformation) If all physically measurable 
information stored in the wave function y is also stored in the product wy, 
then the wave function y must be considered equivalent to the wave function 
YPU where Y is any differentiable c-unitary Clifford number. What is the 
consequence of this supposition? 


(1) Suppose y’ = YU or y = V/A. Suppose y is a solution of Eq. (8.8). 
Show yw’ is a solution of the equation 


F : d , imc , 
y" V. zu y" Ba = USE. V 


where 
Bi, = UBU — (V, UU. (8.39) 
(2) Since V,I = 0, it is clear that 


(V,U)U = VLUU) — UVU = — rv). (8.40) 
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Use Eqs. (8.39) and (8.40) to show that 
VB, — BB, = A (V, 28, — BBU 
T (V,A47) 28,2) + (V, 2) B, U — (V, VUDU. 
(8.41) 
(3) From Fq. (8.41), it immediately follows that 
V,B, — V,28, + 28,08, — BB, 
= (V, 28, — V, 28, + BB, — 48,48,) 94 
— ((V, V, — V, VUNU. 
Use Eq. (5.70) to show that 


V, B, —V, B+ BB — BB, — IR 
= UV, B, — V,By + B,B, — By B, — FRU. (842) 


(Comment: If 2, = F, — (ie/hc) A,I, then 
V,B, — V, + B,B, — 2,20, — 10, = ES Fpl. (8.43) 
c 


Since Y commutes with I, it then follows that the left-hand side of Eq. (8.42) 
is also equal to — (ie/Ac) F,, I. 

Because of the nature of the last term on the right-hand side of 
Eq. (8.39), it is clear that there are solutions to Eq. (8.43) other than 
E, — (ie/hc) A,I. It is conceivable that some of these solutions could cor- 
respond to forces other than gravitational or electromagnetic. On the other 
hand it may be a situation which calls for a gauge fixing condition.) 


Problem 8.5. Use Eq. (8.43) and the reasoning used in Problem 5.14 to show 
that 


y'(V,28, — V, B, + BB, — B,B,) = (-in. E ihr 
c 


(Except for several differences in notation and sign conventions, this equation 
appeared in a paper by V. Fock in 1929.) 


Problem 8.6. Suppose we define the charge conjugate © of a Clifford 
number « by the equation 


AS = -JAJ (8.44) 
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where ./ is the complex conjugate of Z. Suppose y satisfies the equation 


imc 


y Val "S VB, = nó 5 y. 


Show that 


imc 


VW — yy Be = 5 yt. 


(Comment: if 
ais 
hc 
then 


At =T, +£ Al. 
he 


Thus V satisfies essentially the same equation as y but with the charge e 
replaced by —e. If y represents the wave function for an electron then y 
may be interpreted às the wave function for a positron. 

The charge conjugation operation defined in this problem is identical 
to that presented in more traditional presentations such as that which 
appears in Quantum Field Theory by Claude Itzykson and Jean-Bernard 
Zuber (1980). In the usual approach one takes the complex conjugate of a 
specific matrix representation of the Dirac matrices instead of the complex 
conjugate of the coefficients of the Dirac matrices. This has the effect of 
making the traditional definition unnecessarily complicated.) 


Problem 8.7. 


(1) Show (BY = A BE. 
(2) Show (AE) = wf. 


Problem 8.8. In Prob. 8.6, it was noted that if Z, was a linear combination 
of an imaginary 0-vector and a real 2-vector then the application of the 
charge conjugate operation results in a potential with the sign of the 
imaginary 0-vector component reversed and the sign of the real 2-vector 
component unchanged. Suppose &, is a generalized potential such that 
48] = —@,. In that circumstance what happens when the charge conjugate 
operation is applied to 2,? In particular, what happens to the sign of the 
imaginary 1-vector component; the real 3-vector component; and the 
imaginary 4-vector component of 22,? 


Problem 8.9. According to Eq. (7.58), the equation of motion for a point 
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charge is 


q 


Xs 2mc? 





(F (s)u(s) — u(s) F (s). 


Show that applying the charge conjugate operator to this equation is 
equivalent to changing the sign of q or changing the sign of s. 


82 Clifford Solutions for the Free Electron in Flat Space 


For the free electron in flat space, Dirac's equation can be written as 
—w = —meyp. (8.45) 


Since #*(G/dx") maps odd p-vectors onto even p-vectors and vice versa, 
it is obvious that y must be a Clifford function with a mixture of odd and 
even p-vectors. Suppose one considers a possible solution of the form: 


E i : 
w= (I + *e) ew(-; p»). (8.46) 
where 
E 
(Pos Di Pz P3) = (E, —Pe — Py» -.) 


and 


(x9, x3, x?, x) = (ct, x, y, Z). 
Tf we now substitute the right-hand side of Eq. (8.46) into Eq. (8.45), we have 
— (f*p,) + Fej) = —mc(1 + Fc). (8.47) 
Equating the 1-vector components on the two sides of Eq. (8.47), one gets 
p, = mepie, 


and thus 


J 


c =P for j = 0, 1, 2, and 3. 
mc 


Plugging this result back into Eq. (8.47) gives us 


GPG p) =m eI 
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or alternatively 
(9. Gp?) -mcl 
or 


E? 
= (p)? = mre? 


where 
3 
(p? = Y, (PP = BB = (py)? + GD + GJ. 


This implies that 
E = + me + (pfc?. (8.48) 


Letting E designate the positive square root, we get a solution in the 


form: 
p PE 
yo = | + «(2 api pa). (8.49) 
mc h 


If we replace E by — E and p by —p, we get a second solution: 


p AN 
y= | — «(E ew p»). (8.50) 
mc h 


Generally a wave function is considered to represent a particle with 
energy E if E is an eigenvalue of the operator ( —71/1)(0/0t), that is if 


h ô 
—— y = Ey. 
jut oh 


According to this assumption, a wave function of the form that appears in 
Eq. (8.50) would have to be interpreted as representing a particle with 
negative energy. One could choose to regard such wave functions as 
extraneous solutions. Alternatively one could choose to regard these solu- 
tions as ammunition for the argument that Dirac's equation represents a 
certain amount of nonsense. 

Dirac chose neither of these options. When he presented his equation 
in 1928 he advanced the theory that these negative energy solutions implied 
the existence of antiparticles. Such prominent men as Niels Bohr and 
Wolfgang Pauli ridiculed him for such wild speculation. However, Dirac had 
to put up with this criticism only for a few years. In 1932, Carl D. Anderson 
discovered the positron and Dirac was awarded the Nobel Prize the 
following year. 
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It 1s useful to observe that 
p* 
(E) = BiB", (8.51) 
mc 
where B is a boost operator. In particular 
= $ a 
48 — exp z Ji}, (8.52) 


à = faau? + 31u? + Pou, 


Jů = y1ou! + yao? + yaoi, 


cosh $ = u? = MEM TE 
J/1 — (v/c)* 
dde ue e 
1 — (v/c)? 
Using Eqs. (8.49)-(8.51), we get 
y? = BU +) e= rst) (8.53) 
and 
VO = BU —9,)Bo? ew ns). (8.54) 


Because of the form of Dirac's equation in flat space, it is possible to 
multiply any solution on the right-hand side by a constant Clifford number 
to obtain another solution. In this fashion, the most general solutions can 
be written in the form 


MEE "e past aa T 9€ (8.55) 
and 


yo = ew no ara - $94, (8.56) 


where @ is an arbitrary constant Clifford number. 
If we now insist that y be an eigenfunction of the spin operator (ih/2)5,; 


DIRAC'S EQUATION FOR THE ELECTRON 183 


in the rest frame and @(ih/2)},,B ! in the laboratory frame, it should be 
noted that 


z A h T 
< faQ E ifi) = +50 tdi). 


Incorporating the spin terms into the solutions gives us 


yD = exp( - i nat) + $90 + i912); (8.57) 
ytd = exp( =i pax) a + $90 —i$,)«; (8.58) 
yO = aw pax" ara — $9 + 1$44)€; (8.59) 
y= ew; pax" aa — $9 — i942). (8.60) 


One advantage to this formulation is that it is easy to represent a state 
with arbitrary spin direction. If (st, s?, s?) represents a vector of unit length 
then we can represent the state of a free electron with spin in that direction 
as follows: 


y? = ew n ara + $9 + i$)€ (8.61) 


where 
$ = $535! + P3187 + P128°. (8.62) 


It should be understood that $ represents the spin direction in the rest 
frame of the particle. (See Problem 8.14.) 

It is useful to note that ¿(I + $9) and 4(7 + i$) are commuting projection 
operators; that is 


30 + $930 + $9 = 3U + fo); (8.63) 
3 + $930 — fo) = $0 — $9310 + Fo) = 0; (8.64) 
WI + i 4 + is) = 10 i$); (8.65) 


and if P = 1I + 29)1 + i$) then (P)? = P. 
To compute the 4-current, we note that from Eq. (8.61), we have, except 
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for an appropriate normalizing constant: 

ey CO C7 = eBIl+ $9) -- i$) (1--iS(1-- $9)28 ^. — (8.66) 
It will be shown in the next section that there is no loss of generality if @ 
is required to be a linear combination of real even order p-vectors. From 


Eq. (A1.8), we know that for such a Clifford number, 66t = aI + bJ where 
a and b are real numbers. Also J anticommutes with fọ so 


(+ Po) IU + Fo) = JU — Po) + Fo) = 0. 
Using this result along with Eqs. (8.63) and (8.65), Eq. (8.66) becomes 
ey YT = 4aed8(I + $9) + iB. (8.67) 
The 1-vector component of this is 
JO = J*$, = 4aeB foB = 4ae(},u*), (8.68) 


where the u*’s are the components of the world velocity. 
For positrons, a similar calculation gives 


ey yt = 4aeBU — SAU + i287! (8.69) 

and 
JO) = J*j, = —4aed8)928 ! = — aef,u*. (8.70) 
Problem 8.10. Show that for the free electron, the phase velocity vpyasp = 
E/p. Show that the velocity of the 4-current or the velocity associated with 


the boost operator is pc?/ E. (Note: vpyase¥curRENT = C^ UcuRRENT < C, and 
UppasE > C.) 


Problem 8.11. Show that 
exp( —iw) + 912) = exp( t ?12w)(l + $5). 


(This means that the term *exp[ —(i/A)p,x*]" that appears in the free 
electron solution may be interpreted as a time-dependent rotation operator 
about the spin axis. I have no idea whether or not this has any significance.) 


Problem 8.12. Use Eqs. (8.67) and (8.37) to determine both the electric and 
magnetic dipoles of the free electron. Observe that the electric dipole is 
induced by the motion and is perpendicular to both the direction of the 
magnetic dipole and the direction of motion. 
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Problem 8.13. Apply the charge conjugate operation defined in Problem 8.6 
to the free electron solutions of Eqs. (8.57)-(8.60) and thereby demonstrate 
that 


(y C DC EN yo and (Wt DYE = yf D, 


Problem 8.14. Suppose the y^ is the solution for the free electron written 
in the form of Eq. (8.61). Show that 28(ih/2)522- tw = (h/2)y C. 


8.3 A Canonical Form for Solutions to Dirac's Equation in Flat Space 


The free particle solutions to Dirac's equation suggest that one can find 
solutions of the form: 


y — DU xij) t 9$. (8.71) 


This is indeed the case. In itself this does not say much. If ® is a solution, 
then we can multiply it on the right-hand side by any constant Clifford 
number and the resulting product must also be a solution of Dirac's equation. 
Since (I + if,.)7 + fo)@ is such a constant Clifford number, it is clear that 
the product ®U + if,.)U + $9)€ is a solution if 6 is a solution. Furthermore 
an arbitrary solution can be written as a linear combination of such forms 
since 


DE = 20 i$) + 9€ + 198 — 10,0 + 29€ 
+ 49 + ifi — 49€ + 20 — ipa) — Fo). 


What makes the solutions of the forms that appear in Eq. (8.71) useful 
is the fact that the Clifford number ® can be cast in a form that lends itself 
to physical interpretation. First of all, ® can be written as a linear 
combination of real p-vectors of even order. To prove this, we will consider 
the case for which 


V = e + ifi) + Jo) €. 


(It should become clear to the reader that the argument used for this case 
can be generalized to all four cases that appear in Eq. (8.71).) 

Any complex Clifford number can be written as a linear combina- 
tion of a real Clifford number and an imaginary Clifford number. Thus 
® = p, + i, where both ®, and ®, are real. However, (I + if,.) = 
191; + if,,). Therefore 


OT + 19,2) + Fo) = (6, + 19) + 19,0 + PoE 
= (®, — 6,5,,) + i912) + fo)@. (8.72) 
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Thus if ® is complex, it can be replaced by the real Clifford number 
®, — 0,5. 

As stated above it is also possible to require that ® be a linear 
combination of even order p-vectors. To show this, we note that an arbitrary 
Clifford number can be written as a sum of two Clifford numbers. One 
consists of a linear combination of even order p-vectors and the other consists 
of a linear combination of odd order p-vectors. That is 


® = du + Dopp. 


On the other hand (I + ĵo) = foll + $9). Thus 


(I + if 12) + 79) = (even + Popp UT + 19,,) + fo) E 
= (Deven + Dopp fo) + 191:)0 + $9). 


Therefore if ® has some odd order components, ® can be replaced by 
Piven + Donno which consists of even order p-vectors only. 

Clearly the same comments made above on the Clifford function ® also 
apply to the Clifford number @. That is @ may be restricted to be a linear 
combination of real even order p-vectors without losing any generality. In 
the last section, it was pointed out that with this restriction, the choice of 
€ has no impact on the product wy" except for a possible normalizing 
constant. On the other hand, the arbitrariness of @ is critical in scattering 
theory where one must compute products of the type yy} where y, and 
V; represent Clifford wave functions for different particles. 

The fact that ® may be restricted to a linear combination of real even 
order p-vectors enables us to extract a physical interpretation from it. From 
Theorem A1.1 and Theorem A1.2 of Section A.1 of the Appendix, we know 
that if bo! + 0, then 


o = NBR FY, (8.73) 


where N is a normalizing constant, 27 is a boost operator, 2 is a rotation 
operator, and Y is a duality rotation which has the form 


J= exp($ j (8.74) 


One may ask what is the significance of the duality rotation operator. 
When one computes yyt, one has 


wt = » BRIU LIN xit an. (8.75) 


Since J commutes with §, the duality rotation operator has a direct effect 
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only on (J + ĵo). We note that 
Jh = exp( 2 7j o(s 1) 


= ew( 7) op? Ii = fo. 


IU + oA = exp(oJ) + fo 
= (I cosw + J sin w + fo). (8.76) 


N 


N 


Therefore 


Plugging this result back into Eq. (8.75), we have 


N2? 





yy = T BRI cos w + o + i$ cos w + iJ$ sin w + ijo$ +J sin w) RTB. 
(8.77) 
From Eq. (8.37), the electromagnetic dipole 2-vector is 
iea eN?h "m Nm 
M — 3M = — En 2899 ($ cos o + JS sin w)R ^28 !. (8.78) 
This suggests that 
N?h 





($ cos œw + J$sin o) 
8mc 


may be interpreted as the electromagnetic dipole 2-vector in the rest frame 
of the electron. In particular if $ = ,,, then in the frame of the electron, 


N?h 
M,--M" E coso (8.79) 
mc 
and 
N? 
P. = —M?° = ie sin o. (8.80) 
mc 


For the free electron, œ = 0. However, for the electron associated with 
a hydrogen atom, this is not the case. Presumably the electric dipole is 
induced by the inhomogeneous electric field due to the nucleus. It should be 
pointed out that both N? and œ have a spatial dependence. Thus M* has 
a spatial dependence. 
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Problem 8.15. Suppose M** is the electromagnetic dipole tensor, that is 


0 PB P B 

-B 0 -M, M, 
[M^] = 

-P M, 0 -M 


-P -M, M, 0 


Suppose further that P, — P, — P, — 0. Apply an arbitrary boost operator to 
the 2-vector M = 3M*y;, and show that in the “boosted” frame the electric 
dipole is usually not zero but it is always perpendicular to the magnetic 
dipole. 


Problem 8.16. In the usual approach to Dirac's equation, one uses a specific 
representation for the Dirac matrices and then seeks out vector solutions. 
These vector solutions are then used to compute expectation values. This 
problem is intended to provide the reader with a comparison of this approach 
and the approach presented in this text. 

In virtually all representations used in actual computations 7° is 
Hermitian and ® is anti-Hermitian for j = 1, 2, and 3. 


(1) For such a representation show that if y is an arbitrary Clifford 
number then 


yp = Py (8.81) 


where w* is the Hermitian conjugate of y. 
From Eq. (8.81), it follows that 


Py dy = yt. (8.82) 
If y? = DU + i; + fo), then 299071 = yO and 
YS ogy) = WOT gy, (8.83) 
If y^? is a negative energy solution—that is if 
UW = O( + ifi) — fo), then Pyt = -y 


and 
y C ayy S — C gy, (8.84) 


When dealing with a specific representation of Dirac matrices, 
any column of a solution matrix y may be considered to be a column 
vector solution. In the usual formulation, one computes expectations 
in terms of these column vector solutions. For example, one obtains 
the components of a “probability current" J* from the defini- 
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tion: J* = (y *j9?j) where y is a column vector solution. More 
generally, the expectation of a matrix operator £ is computed by 
computing <> = (P* AY). 

(2) Using the representation of Eq. (2.8), compute each of the four 
matrices: 


à + ipsa) + Po). 


(3) Use the result of part (2) to show that if y is in one of the four 
canonical forms (I + if,,)U + fo), then for the matrix representa- 
tion of Eq. (2.8), w has only one nonzero column. 

(4) Show that if y is in one of the four canonical forms and y is its 
nonzero column then 


GPA) = PA = 4S Ah). 
= Aip = +4 Ay = x4, Ap). 


The + sign occurs for the positive energy solutions and the — 
sign occurs for the negative energy solutions. Thus we see that the two 
formalisms give essentially the same expectation values. The factor of 4 
disappears when one assigns different normalization conventions to the two 
formalisms. In the standard formalism, (y *j9j^y) is interpreted as a 
probability current and the sign is adjusted “by hand" when the probability 
current is multiplied by +e to give the charge current. In the formalism of 
this text, the computed current is interpreted as a charge current and the 
sign difference between particles and antiparticles arises from the formalism 
automatically. 


8.4 Spherical Harmonic Clifford Functions 


Before solving Dirac's equation for the hydrogen atom in the next section, 

we will examine the Clifford analogue of the spherical harmonic functions 

for dealing with Dirac's equation when the potential is spherically symmetric. 
If it is understood that 


ho 
—-— y = Ey, 
ia y 


then the time-independent version of Dirac's equation for an electron in a 
Coulomb potential is 


(2 _ Ze ) +È gar E T (8.85) 


=1 1 





This equation can be rewritten as an eigenvalue equation by multiplying 
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both sides by cj? and regrouping the terms. The result is 


Hy = Ey, 


where 


It can be shown that 


JH = HJ, for k = 1,2, and 3, 
where 


J, = Li+ S, 


T2, a 
LL. 2 ers) 
aa (x xe T a] 





ih 
S, = 3 723» 

ih , 
S; 7536 

Ih, 
Da 5 Tid 


From Eq. (8.88), it follows that if Hy = Ey, then 


AW) = 5H = BEY = Ey). 


(8.86) 


(8.87) 


(8.88) 


(8.89) 


(8.90) 


(8.91) 


(8.92) 


(8.93) 


(8.94) 


(8.95) 


This means that if y is a solution of the time-independent Dirac equation 
with energy eigenvalue E, then J4 is also a solution with the same 
eigenvalue. This in turn implies that any solution to the time-independent 
Dirac equation can be decomposed into a linear combination of Clifford 


functions each of which is an eigenfunction of J. 


Since neither J, nor J, commutes with J}, it is not possible to find Clifford 
functions which are simultaneously eigenfunctions of H, J}, J, and J. 
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However, J; does commute with |J|? where 
|J? = (A? + (4)? +BY. (8.96) 


Thus if H corresponds to a Coulomb potential or any spherically symmetric 
potential, it is possible to find Clifford functions which are simultaneously 
eigenfunctions of H, J;, and |J|?. For this reason, it is useful to determine 
those Clifford functions which are simultaneously eigenfunctions of J; and 
|J|?. To pursue this goal, one should note that the eigenfunctions of L, and 
Lj? are well known as spherical harmonics (Jackson 1962, pp. 54-69; or 
Abramowitz and Stegun 1965, pp. 332—341). These functions are defined by 


Lia)! 
4n (1+ A)! 





Y, (0, $) = Pi(cos 0)e^*, (8.97) 


where P? is an associated Legendre function which can be defined by the 
equation: 


(-1y 
2! 





L+A 
Pi(u) = (1 — uy? Im (u? — 1y. (8.98) 


(Here | is a nonnegative integer and 4 is any integer between —/ and +1.) 
These functions satisfy the relation 


L; n= hAY, ; (8.99) 
and also 
IL Y, - R1) Y, i. (8.100) 
We note that 
2 3 ape : 7 3h? 
|J|^ = ILI + th(P23L, + 3122 + Pi2bs) + 4 I. (8.101) 


Each of the operators L,, L,, and L4, map linear combinations of 
spherical harmonics for some fixed value of l into linear combinations of 
spherical harmonics of the same type. Because of this, Clifford eigenfunctions 
of |J|? and J; may be written in the form: Y = Y, ;.«7^, where the ./*’s are 
constant Clifford numbers—at least not functions of 0 or $. 

Since Zt = 1(I + ij, ,).o7^ + HI — i$, 2)’, we see that 


ih, A ih. " 
LY = (z + 2h) Y, 430 + ifi) At + (z + ie) Y, 430 — ifa) At. 
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Since 19,5 + 19,5) = X +1),,), this last equation becomes 
BY = h(4 + 3) Y, 3 + 1913) ^ tA — 3 Y, 3 = ifi) 2’. 
If RY = mhY, then 
Y = Yu aD + iP) + Yims y2(l — i912). 


It will be useful to replace /’ by 74,48. If we pause to note that 
CU — 1912)931 = $31 + if,2), our equation for Y becomes 


Y = Y, 430 + ifi) A + Yim+sh312 + if 12)B. (8.102) 


Now we need to adjust the Clifford numbers S and & so that & is also an 
eigenfunction of |J|?. We note that without this adjustment &/ is already 
an eigenfunction of [L|?. From Eq. (8.101), we see that for Y to be an 
eigenfunction of |J|?, it must also be an eigenfunction of 


Pohk = 934L, + ifla + ifi aL3 + Al. (8.103) 
This may be rewritten in the form: 


$ohk = 34,10 + ifi) Ls, — $4410 — i9,2.)L_ + i9,.L, + hl, (8.104) 
where 
L, = L, +iLl,, (8.105) 
and 


E LÀ, (8.106) 


(Labeling our operator by ?)k may seem strange, but the operator k will 
prove to be helpful in its own right.) 

The operators L, and L.. are known respectively as step up and step 
down operators. It can be shown with some difficulty that 





LO 5SRhJ0—20342* DX (8.107) 


and 





L_¥,, h/(l - 200 A Y, (8.108) 
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From Eqs. (8.102), (8.103), (8.107), and (8.108), we have 


fohkf = A / (0-3 — m? Y, 430 if.) 
+ h(m — Y, 4,30 ifi. 
BAKE S — m Y, mafa + if 12) A 
— h(m + 3) Y, 4,423130  1$1,)28 + hij 
= hk. 


Equating coefficients of Y, ,,.., 3(1 + if,5) and then coefficients of 


Y, m+493140 + ifi) 


we have 


J/ 4-3)? — m? B+ (m — 4). + of = kod 


and 
U+ — m? s£ — (m 4- 3)28 4- 28 — kB. 


Written in matrix form, these two equations become 


| vie be 
JUD m -(k-iem LA) iol 


The necessary condition for a solution of this matrix equation to exist is 
that the determinant must be zero. This means that (k — 4)? = (I + 4)?. From 
this, we get two cases: 
Case 1:k 2 14 1, (8.109) 
Case 2: k = —I. (8.110) 


For case 1, 


a- YOD w p Vision, 


l+3+m Vim 





We then have 


l| - 39m l| - $—m » » 
Yn = ( uc Ym- + a a Y, m+4931 ]20 + i942) 


where @ is a Clifford number which is a function of r only. From Eqs. (8.101) 
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and (8.103), we have 

|J]? = ILZ + $9? k — V? I. (8.111) 
Thus 

IPUR = PIG + 1) - (04-1) - 119/92 
-4 HDE + DY. 

If we insist that 

IPY, m = FIG + DY, m 
then 

j=l+ż 


and we have 


jtm lj —m : de 
qo = (po Yi, m-& + U*»P Yamata HC + if 12)6 


where k =j +4=1,2,3,...andm= —j, —j+1,..., +). 
A similar calculation for case 2 results in the identity: 


z j+l—m fj+1l+m l 
Ys z = du Y, mop [Hu Yat lI +i € 
: (s 1) ere 2 + 1) 1+ 4.m+4P31) 26 512) 


where k = —(j + 3) = —1, —2, —3,... and m = —j, —j + 1,..., +j. 
In either case 


IY) = mY, (8.112) 
and 
[JPY = j(j + ays. (8.113) 
Furthermore 
PRYS = G+ 2915, (8.114) 
and 
PRY = —(j 4 DM. (8.115) 


It is useful to require that these spherical harmonic Clifford functions 
also be eigenfunctions of fọ. To achieve this, we merely insert the term 
1 + fo). That is 


j-my, n jj—m 
2-i( 47 2j Y, -4,m-4 t 731 I j Yrs) + 1912) + $9*€ 


(8.116) 


i 
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and 





o j+l+m ) 2 . 
Ysni JU +1 I $9) 8.117 
Yai 2d * D Jtimc4 ( 513X Po) ( ) 


With this modification. Eqs. (8.112)-(8.115) remain valid. But we now have 
two more: namely: 


Kt = (G+ DY» (8.118) 
and 


kB) = —G+ DY. (8.119) 


These spherical harmonic Clifford functions are now eigenfunctions of 
$9 with the eigenvalue +1. One would think that it is also necessary to 
construct spherical harmonic functions with eigenvalue — 1 for ?y. One way 
to do this is to insert either the 1-vector 7, or the 2-vector 7,. in front of 
UY and YC as defined by Eqs. (8.116) and (8.117). However, in either 
case the inserted factor can be absorbed into the nonangular component of 
the Clifford wave functions that are computed for Dirac's equation. Thus no 
generality is lost if one restricts consideration to the spherical harmonic 
Clifford functions defined by Eqs. (8.116) and (117). 

It is interesting and perhaps useful to reorganize these spherical har- 
monic Clifford functions as rotation operators acting on the projection 
operator ¿(I + i?,5)1 + $9). From Eqs. (8.97), (8.116), and (8.117), we have 


gi gin - Dó (ERE 
x 4 4n(j +m)! 


x [G + m) PP 2 (cos 0) + PY (cos 0)e^94,]0 1913 + 29€ 








J-3 
(8.120) 
and 
aer gin - à Him. 
ne 4 4n(j +m)! 
x [(j — m + DP c$(cos 0) — P? 2 (cos 0)e99,,] 
x (I 4-109153) + $9. (8.121) 


It is possible to replace e/^?,, in both Eq. (8.120) and (8.121) by a real 
Clifford number. To accomplish this we note that if 30 + if15) = (1 + 1f,,). 
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Therefore: 
e95 1 + i912) = (93, cos à + if; sin 9) + ifia) 
= (931 cos $ — (if31)(if12) sin PU + i942) 
= (73, cos  — P23 sin 9) + 19,2). 
Interestingly enough, it turns out that if one uses spherical coordinates, 


A 


9,, cos $ — f, sin o = ^9, (8.122) 
r 


The 2-vector $,)/r is normalized so that (?,,/r)? = —I. 
Equations (8.120) and (8.121) now become 


aft) = gim-ne | U — mt 
did 4n(j 4- m)! 


«|o + m) P7 (cos 0) + 2 P™*s(cos o fia tia + 90) 
(8.123) 


and 


a) = gim-ne | UO — mt 
"s 4n(j +m)! 
«Jo — m + 1) P7: (cos 0) — T" Pr dcos » lia + ifi 39€ 
(8.124) 


It is now possible to interpret these spherical harmonic Clifford functions as 
the product of a scalar function, a rotation operator, and the projection 
operator 4(J + if12)(I + ĵo). Suppose we define 


m 4 B - G AT m)! 
(N7(0))* cos 2 ax Em (j + m) Pjg (cos 0) (8.125) 


BO (j — m)! pms 
2 4n(j +m)! PT 





and 


(N7'(0))* sin *(cos 0) (8.126) 





Then Eq. (8.123) becomes 


qu = ein orgy cos É 4 7? sin 4 id + 19,2) + $9«€. (8127) 
r 
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The angle f depends not only on 0 but also on the indices j and m. Thus 
it is perhaps true that we should attach the indices j and m to the angle f. 
However, doing so becomes somewhat cumbersome. Furthermore the angle 
p will not appear in any expression without the indices j and m appearing 
elsewhere in that same expression. Thus the addition of the indices to the 
angle f seems unnecessary. 

If we square both sides of Eqs. (8.125) and (8.126), we get 


wo- iUm 
4n (j +m)! 





[G + my(P7-3Y? + (po*3y^]. (8.128) 
The N"(8) functions have been normalized so that 
n [2n ps 
| | N7(8) do sin 0 dé = s N'7(0)sin8d0 — 1. (8.129) 
0 Jo 0 
It is somewhat surprising that 4/1) can also be expressed in terms of 


the same angle f. Accordiing to Magnus, Oberhettinger, and Soni (1966, 
p. 171), we have: 


(p — q + 1)P$.,(cos 0) = (p + q + 1) cos OP4(cos 0) + sin 0P2* ! (cos 0) 
(8.130) 


and 


—P4,,(cos 0) = (p + q) sin P2 ‘(cos 0) — cos OP4(cos 0). (8.131) 
Equation (8.130) implies that 


(j—m + IP = (j + m) cos 0PT-? + sin OPM? (8.132) 


or 


G- m)! 


mG + m)!  —m- DP 


= (N> (cos 0 cos £ + sin 0 sin 2) = (N™)* cos( o — Al (8.133) 


In a similar fashion, Eq. (8.131) implies that 


se D PE 
JE e emt = em sin(o Al (8.134) 
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With these relations, Eq. (8.124) becomes 


quU) eem DAN) 
x | cos( 5 j "nm n(o- al + i9,,)L +4). (8135) 
r 


From the computations of Section 8.1 it appears that most if not all 
expectation values can be extracted from the product yyt. With this 
motivation, let us compute the product &/(7)49/(*)*. To do this, we will assume 
that the constant Clifford number @ that appears in Eq. (8.127) is c-unitary. 
We will also use the fact that the square of the projection operator 
iU + ifi) + 99) is equal to itself. We then have 


Yay = w(i cos B + fro sin j 
! s 2 r 2 


x (i$ + i cos à — * sin B), (8.136) 
r 


The operator ĵo commutes with the rotation operator and 


(ros rin reos Manh) = 
2 r 2 2 r 2 


From Eq. (8.122), $,, anticommutes with 7,,. Thus 


(: cos + "sin? p ^ Jaso(1 sos ss ire sin 5) 
2 2. F 2 





= a(i cos p zit sin Al = = inj cos fj — ^? sin i sin), 
2 r 2 r 
Thus Eq. (8.136) becomes 
YOY = NBL + i1 + ind cos 8 — sin h^ sin p) | (8.137) 
It can also be shown that 
A Pow Por 


aedem cos 0 sin 0. 8.138 
^12 7 2 in 8 r sin 0 ( ) 





If one substitutes the right-hand side of Eq. (8.138) into Eq. (8.137) and then 
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carries out the required multiplications, one finally obtains 


Miu = ivo 1e 2 d m Wis QUO BER a gie- o Jur + Yo). 


(8.139) 


To obtain the product YCR it is only necessary to replace the angle 
p/2 that appears in Eq. (8.127) by the angle (0 — ß/2) that appears in Eq. 
(8.135). This means that instead of getting $ — 0 or 2(f//2) — 0 in Eq. (8.139), 
one gets 2(0 — B/2) — 0 or 0 — B. Since cos(0 — B) = cos(f — 0) and 
sin(@ — B) = —sin(f — 0), we get: 





YAW = ANT) | + 1% cos(p— 0) — —*— sin(p — oa + fo): 
: , r^ sin 0 r sin 0 
(8.140) 
Because of the critical role that these products of spherical harmonic 
Clifford functions play in the computation of currents and dipoles, it is useful 
to introduce an explicit notation for the functions that appear in Eqs. (8.139) 


and (8.140). In particular, let us define: 


C7(X) = N70) cos(B — 0) (8.141) 
and 


S7(0) = N70) sin(f — 0). (8.142) 
With this notation, we have 
Yay) = [Nro + 7 C"'(0) + des sola + fo) (8.143) 
Dai 4 7 r sinb ” rsinü ! 


and 








= = 1 m m r m, 
ama = 1 IN] (0) 4- C] (0) — e 557 rofa +>) (8.144) 


To obtain explicit formulas for C7(0) and S7(0) in terms of associated 
Legendre functions, one should note that 


cos(f — 0) = ex — (0 — 4) 
= cos B cos( 0 — ^ + sin B sin( 0 — P. 
2 2 2 2 
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Using this relation along with Eqs. (8.125), (8.126), (8.133), and (8.134), one 
gets 


C7(0) = N78) cos(B — 8) 


= es Ure [G+ m- m + DPE PYÈ 


An (j + m)! — PREP RTA. (8.145) 


In a similar fashion, one finds that 
S7(0) = N78) sin(B — 0) 


_1G-m)! 


4n (j + m)! [(j —- m + ])PT*à Pree ++ m) P PRT. (8.146) 





From Eggs. (8.128), (8.145), and (8.146), it is possible to show that 


N;"(0) = N™(6), (8.147) 
C;"(0)- —C"(0). (8.148) 
$;"(0)  —S"(0). (8.149) 


(See Problem 8.23.) 

From Eqs. (8.98), (8.145), and (8.146), it is also possible to show 
that the C7(0)'s are polynomials in odd powers of cos 0 and the S7(0)'s 
are polynomials in odd powers of sin 0. For low values of j and m, we 
have 


A 1 
Cł(0) = "m cos 0 
E l 
S$(8) = Ee sin 0 
Ni(0) = tn 
1 
C350) = = sin? 0 cos 0 


53330) = Ae sin? 0 
8x 


1 
N32(0) = — = sin? 0 
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1/2 1 in2 
C3/3(0) = — (—9 sin? 0 + 4) cos 0 
81 
1/2 1 1202 s 
S12(0) = — (9 sin? 0 — 8) sin 0 
8x 


1 ; 
NYX0) = x6 sin? 0 + 4) 


15 
C2/3(0) = —- sin* 0 cos 0 
2n 
15 
S2/2(0) = ——— sin? 0 
5/2(0) cm 


15 
N32/3(0) = 32x sin* 0 


C3/3(8) = ERG 25 sin* 0 + 16 sin? 0) cos 0 
S3/2(0) = — 5,05 sin* 0 — 24 sin? 0) sin 0 


N3A(0) = 2 (—15 sint 0 + 16 sin? 0) 


C320) = = (25 sin* 0 — 24 sin? 0 + 4) cos 0 
D 
Sy2(0) = uc 25 sin* 0 + 36 sin? 0 — 12) sin 0 
1/2 3 int e) 
N1320) = — (5 sin* 0 — 8 sin? 0 + 4) 
16n 
Problem 8.17. Show that 


a a 
DE alari- 2). 


(It is understood here that the square brackets designate the 
commutator brackets. That is [..7, 22] = JB — BA.) 


S ð 
(2) Show that |^ 2 4 zi = 
k=1 Ôx 
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(3) Show that [J3, f(r)) = 0. 
(Comment: parts (2) and (3) serve to verify Eq. (8.88).) 


Problem 8.18. Show that the operator k commutes with $,, r?, ĵo, |J|?, J,, 
J, and J. 


Problem 8.19. Show that 
ky = l J]? +H 
(ky = zzl |^ al. 


Problem 8.20. Use Eqs. (8.97) and (8.98) to show that 





Ls Ypa =A p- aptat Y, 


(Suggestion: first show that 
L, = vei 2) 
00 | sind dd 
Also show that from Eq. (8.98) 


1 1 @ \Pt4 
P4(cos 0) = —— sin? em 2) cos?? f. 
2?p! sin 0 00 





If you wish to show that L- Y, , = h/(p + ap — 4+) Y, 4-1, it is prob- 
ably useful to use the relation that 


(p — a)! pa 


P>4=(—1) 
pus APTF * 





Problem 8.21. 


(1) Show that if we apply the charge conjugate operator defined in Prob. 
8.6 to the operator k of Eq. (8.104), one gets k€ = — Kk. Use this result 
to show that 


KYE = FG 30$. 
(2) Show that (YENE = —mh(4/()* and 


IPO we = WIG + DOWD. 
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Problem 8.22. Use the relation that 
i . 
| P4x)PXx)dx=0 ifpsr 
-— 
and Eq. (8.145) to show that 


| C70) sin 0 dé = 0. 
0 
Problem 8.23. Use the relation that 


(P — 4)! p, 
(pa! ° 





P74=(-1)! 
and Eqs. (8.128), (8.145), and (8.146) to show that N; "(0) = N7(0), C; "(0) = 
— C7(0), and S7 "(0) = —$"(0). 

Problem 8.24. Define the 2-vector 
1 x? x? 


ifo n^ +A EV 
zaa Yz ta Ida 
r^ sinf 0 r r r 





Show 


Pog z 
WUY) = yo 
(= sin? ; s xc 


(Comment: this shows that Y} and 4/1 have opposite parity.) 


Problem 8.25. Under the symmetry operator of parity P; Px* = —x* for 
k = 1, 2, and 3; and Pf(6, $) = f(x — 0, $ + n). 


(1) Use Eqs. (8.97) and (8.98) to show 
PY, (0, $) = (— 1)" Y, (0, $). 
(2) Use the result of part (1) to show 
PY = (71) WG = (C7 1997 
and 


PY = (71392 = (— D'93 
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85 Clifford Solutions of Dirac's Equation for Hydrogen-like Atoms 


Having computed the Clifford eigenfunctions of the operator k, we now turn 
to the problem of finding the Clifford function solutions to Dirac's equation 
for hydrogen-like atoms. 

Using Eq. (8.103) and the relation 





3 
x 
F= oy $*—, 
k=1 r 
it is not difficult to show that 
3 1 a ee 

= 7" r rok. 8.150 
PE j ;-f( or ) r i ( ) 


From Eqs. (8.150) and (8.85), Dirac’s equation for an electron in a 
Coulomb potential with energy E becomes 


2 . 
|^ (s F ze) — mel + incr (* Sr) E “ ru —0. (8151) 


To solve Eq. (8.151), it might be thought that we should consider Clifford 
functions of the form: 


y = [AQ) + ??B(r) + if CE) + ij" DTM. (8.152) 
However, since 299/02 = Y+), Eq. (8.152) may be rewritten as 
y = [I(A() + BG) + iC) + DO)IM R. 


This form can obviously be simplified. If we replace (A(r) + B(r)) by F(r)/r 
and (C(r) + D(r)) by G(r)/r, we have 


y= | nD ir 2 less. (8.153) 
r 


If one now substitutes the form for y that appears in Eq. (8.153) into 
Eq. (8.151), and absorbs the matrix f? into &*) wherever it occurs, we have 


2 
|(s NZ mc?) FO) | he ( an j á e ane 
4nr r r \Or r , 
2 
-ir| (c gp me?) E (5- 2 4 ro ur 
4nr r r \er r 








ge 
ll 
o 
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Therefore 
2 F k 
(x +2 — me) oe n Jen =o, (8.154) 
4nr r r\or r 
and 
2 ð k 
(« 2€, m) GO) *( )ro - 0. (8.155) 
4nr r rðr r 


These same two equations appear in the usual spinor formalism used to 
deal with Dirac’s equation for hydrogen-like atoms. To solve them, it is useful 
to abbreviate the constants that appear in the two equations. In particular let: 








e 
= 5 8.156 
2 4nhc ( ) 
m? +E 
j= (8.157) 
he 
2 — 
E uci (8.158) 
hc 
(The constant « is of course the fine structure constant.) 
With these abbreviations, Eqs. (8.154) and (8.155) become 
k 
Set Gare [e p (8.159) 
dr r r 
and 
dF k 
=P @jGs= 6 (8.160) 
dr r r 


If we consider asymptotic solutions of Eqs. (8.159) and (8.160), we have 


dG 
— — (x. )F—-0 and OF =e Ga. 
dr dr 


The most general solution to these two equations is 


F ~ a(a,)? e^ + b(x,)* e ^ 
and 


G ~ a(a_)? e” — b(x )* e^ 
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where 


À = (x,a y. 


If we insist that our solutions remain bounded for large values of r then 
a = 0 and we have 


F ~ b(a, e^? (8.161) 
and 
G ~ —b(x .)*e ^. (8.162) 


On the other hand, suppose we consider solutions of Eqs. (8.159) and 
(8.160) near the origin. Let 


F-2r Y fr an G=r ¥ gr. 


j=0 J=0 


If we plug these infinite series into Eqs. (8.159) and (8.160) and then equate 
the coefficients of r*^!, we get 


(s + gg = —Zafo and (s — k) fp = Zago. 
Therefore (s? — k?) fogo = — Z?a? fogo or 
s= +./k? — Za. 


If we require that our solutions be reasonably well behaved near the origin, 
then we must insist that s be positive, that is 


s= +./k? — Zh. (8.163) 


To obtain exact solutions of Eqs. (8.159) and (8.160) in terms of functions 
which have been studied in detail, one must carry out some manipulations. 
To do this, we first multiply Eq. (8.159) by («,)? and Eq. (8.160) by (x .)*. 
We then add and subtract the two resulting equations. After introducing 
some new functions, the result is 











Š Z IE 
PEE 9 ccc 0 Ps E haga e ] ? (8164) 
dr 2(x,u.)*? F 2(a a .)* r 
and 
= IF 
OTs ek ee ee | NL E dud 1 1 (8.165) 
dr 2(a,a_)* r 2(a,a_)? r 
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where 
F,(r) = (a XF(r) — («.)*G(r) (8.166) 
and 
Fy(r) = (a *F(r) + (x.)*G(r). (8.167) 
Suppose we define 
Za(a, — a_) ZaE 
= i = $ (8.168) 
Waa) Jma E 
2 
E T Zala, + n) Mam Zamc . (8.169) 
2(a,%_)? [mct — E? 
Zala, 4- a.) Zame? 
B. k 4 = k *, (8.170) 
2(a4.a_)* Jmc — E? 
and 
A= (a,a_)?. (8.171) 
Then Eqs. (8.164) and (8.165) become 
r4 e ^ d (r^^ e^ F,) = Peio (8.172) 
dr r 
and 
Aeir d A e) = Top. (8.173) 
dr r 


Having examined the form of solutions near r = œ and near r = 0, it now 
appears to be useful to substitute 


Fil) = e,p*e "I? f,(p) (8.174) 


and 
F,(r) = c;p* e^ 9? f, (p), (8.175) 


where p — 24r and s will assume some convenient value. With these 
substitutions, Eqs. (8.172) and (8.173) become 


1 a-sti d -A*s = 
Boe F (p^ ^**fi(p) = fP) (8.176) 
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and 
C2 
B_c, 


-~A-st+1 


d 
vM p^ e fP) = filo). (8.177) 
p 


If we substitute the left-hand side of Eq. (8.176) into Eq. (8.177) and carry 
out the obvious calculations, we get 








d’ fi H m fi 
Pd + [Qs FU — (s — AM, = (A^ + B, B. uo . (8178) 
p? 
A similar calculation for f,, gives us 
d? 
Pj 4 +[(2s + 1)— A G4 t0Df-2(7-B,B. -b 
p 
(8.179) 
It now becomes convenient to adjust s so that 
s = (A? + B, B È = (k? — Z^a?y.. (8.180) 


(Of course this is the same value for s that was obtained in Eq. (8.163).) 
Our two equations then become 


d? f, 
dp? 








-n t -G- Af - 0. (8.181) 





and 


d? 
pLi- gt- As 0f 8189 
dp? dp 


These two equations are known as special examples of Kummer's differential 
equation. In its general form, Kummer's differential equation is written as 


Ot cay ay aa (8.183) 
= — z} — — aw = : 
dz dz 


Not suprisingly, one of the solutions to this equation is known as Kummer's 
function. It is also known as a confluent hypergeometric function. In particular 
this solution is denoted by , Fj(a; c; z), that is 





Wi) = ‚F(a; c; z) = » zu zi (8.184) 
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where 


(a)g = 1 and (a), 2 a(a + D(a -2)...(a--n—1) forn>0. 


It is understood that c is not equal to a negative integer. An alternate solution 
of Kummer's differential equation can also be written in terms of a Kummer 
function. In particular 


waó(z) = z!^*,F(a—c-4- 1;2 — c;z). (8.185) 


For our purposes, we can only consider solutions which are behaved well 
enough near r = 0 so that 


[> [FO + (G()?] dr < +00. (8.186) 
0 


This condition is restrictive enough to eliminate the alternate solutions. The 
same condition is also quite restrictive on the specific form of the solutions 
that appear in Eq. (8.184). If Re z > + œ, then (see Magnus, Oberhettinger 
and Soni 1966, p. 289): 


I(c) 
F(a; c; z) ~ ——ez-. 
1 1¢ ) T(a 


This means that for most values of a, Kummer’s function blows up too fast 
near infinity for the integral in Eq. (8.186) to converge. In order for the 
integral to converge, it is necessary for a to be either 0 or some negative 
integer. In this circumstance, Kummer’s function reduces to a polynomial 
and the wave function is suitably well behaved at infinity. The require- 
ment that a be a nonpositive integer determines the energy eigenvalues 
for hydrogen-like atoms. Comparing Eq. (8.181) with Eq. (8.183), we note 
that 


a=s—A=-—n. 
Applying Eq. (8.168), we have 
ZaE = (s + n)m?c* — E?)?. (8.187) 


Squaring both sides of Eq. (8.187) and solving for E? gives us 


Z2? 2 —Í 
E? = veli + z ;| 
(s + n) 
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From Eq. (8.187), it is clear that we must take the positive square root. Thus 


2,2 -+4 
Ez nel 4 ae 5 | (8.188) 
S n 





where s = (k? — Z?a)? and n = 0, ,2,.... 

Except for a small correction known as the Lamb shift, these eigenvalues 
agree very well with experiment. 

Now let us return to the computation of the eigenfunctions. Since 
A — s =n, Eqs. (8.181) and (8.117) become 








p IO + [Qs + 1) — p] A + nf, =0 (8.189) 
dp? dp 
and 
T IEEE: df; +(n—1)f, 20 (8.190) 


The polynomials which satisfy these slightly specialized versions of 
Kummer's equation are known as generalized Laguerre polynomials. In 
particular 








file) = Lp) = p ^— ptem] (8.191) 
for n = 0, 1, 2,3,..., and 
e? d"^! d _ 
Jap) = LE (p) = ET dpi [e ^p"-!*?5] (8.192) 


for n = 1,2,3,.... 

To determine F(r) and G(r), we must determine F,(r) and F,(r). To do 
this we must determine the constants c, and c, that appear in Eqs. (8.174) 
and (8.175). To do this, we note that sine A — s =n, 


Cy +1 d (p "LQ9(p)) £ L9 (p). (8.193) 
Bc dp 


From Eq. (8.191), it is clear that the lowest order term in the polynomial 
LE p) is the constant T(n + 2s + 1)/(n!T(2s + 1)). Furthermore, the lowest 
order term in the polynomial L0*.(p) is the constant 


T(n + 2s)/[(n — DIT Qs + 1)]. 
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(Consistent with this constant, I will use the convention that 
fap) = LẸ? (o) = 0 


when n — 0.) Using these values to get the lowest order terms on both sides 
of Eq. (8.193), we have 








=c, I(nt2st1) — I(n + 2s) 
B,c,(n— DITQs- 1) (n— DITQs + 1) 
Or 
+ 2s) 2 1. 
Bic, (n s) 


From this, it can be shown that 


€, = uB, = u(k + [(s +n)? + Za?]*) = ulk + [k? + 2sn + n?]*) (8.194) 
and 
€; = — u(n + 2s) (8.195) 


where u is some appropriate normalizing constant. 
To summarize, we have 


(a4)? 











F(r) = 2; (Fi(r) + F,(r)), (8.196) 
ey? 
G(r) = EUN (A) — Air), (8.197) 
F,(r) = cap e"? L25(p), (8.198) 
F,(r) = c;p* e "^ LEO (p), (8.199) 
p = 2dr = 2(x ,a )*r, (8.200) 
mc? + E mc? — E 
a. = g and a. =- 
hc hc 
If (E (F(r + G(r dr = 1, then 

4h 
(61)? + (5) = ass (8.201) 

mc 


To compare the energy levels with those computed for the nonrelativistic 
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Schródinger equation, it is necessary to expand the right-hand side of 


Eq. (8.188) in powers of the fine structure constant. If one identifies the 
total quantum number n, of the Schródinger theory with |k| -- n, then one 


gets 
2.2 4.4 
Esme’ ZU n (s J (8.202) 
2n) 2(n)NWk| 4 


The first term on the right side of Eq. (8.202) represents the rest energy of 
the electron. The second term is 





mcZ^uw mZ^e* 
2(n? 8h (ny 





This term gives us the Schródinger energy levels. The third term, which must 
be regarded as a relativistic correction, gives the fine structure splitting of 
the Schródinger energy levels. 

Now let us consider the problem of determining the product wy" so that 
we can read off the current and dipole densities. From Eq. (8.153) 


(ei y ass. 
s 


Since f" = —5,, 


wv = (i0 - ij, O) ago FO ag O), (8.203) 
r r 


r 


From Eqs. (8.143) and (8.144), 








1 
Days) = ipm Hee cm) + E e SO |a eo. 


The matrix 75, commutes with 7,, but ?,, does not. However, ?4,0 + fo) = 
rao. + Po) = Poor + fo), and forg does commute with 7,. Thus Eq. (8.203) 
becomes 


yy? pibe ifos 5 C18) + sigah) -tr Pog = 


r r 
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or 





Yor 
r? r2? 


wt = 8) ua C99 ug T 
r? 





1 i i 
e | ivo) + E : C"(0) + sign(I) =e Mor. s sno (8.204) 


According the Eq. (8.19), the components of the 4-current are determined 
by the equation 


= elpy )o. 


If we normalize the Clifford wave function so that fẹ (F? + G?) dr = 1, then 
the factor of 1 disappears and we have 


J9 =e UO E GE N' (0), (8.205) 
r 
J' =J°=0, (8.206) 
and 
DU) 


J? — sign(k)e S70). (8.207) 


For the dipole moments, we note that from Eq. (8.28) we have 
MP = ÈE gy) 
2mc 


Using the same normalization that we used for the 4-current, we have 


M” = eh EUN Nn), (8.208) 
2mc r? 

eh (FOP — (GM) 

2mc r^ sin 0 





M% a — C"(0), (8.209) 


eh -£ ox + (G0)? 


M? —sign(k) = Y 
sin 





$7(0), (8.210) 


and 


M® = M99 =M” — 0. (8.211) 
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Problem 8.26. Show that the polynomial defined by Eq. (8.191) can also be 
written in the form 


Tb id p (8.212) 


n—m/m! 


L2%p) = Y (— »( 
m=0 


where 





amano 1)... (m + 2s + 1) 

n—m (n — m)! ` 

Problem 8.27. Use Eq. (8.212) to show that the generalized Laguerre 
polynomial is a solution to Eq. (8.189). 


Problem 8.28. (The results of this problem show how the nonrelativistic 
solutions of Schrödinger’s equation for the hydrogen atom can be obtained 
as limits of the Dirac solutions.) Suppose the normalization constant that 
appears in Eqs. (8.194) and (8.195) is adjusted so it is consistent with Eq. 
(8.201). Consider the consequence of then taking the limit c > œ and 
permitting the fine structure constant to go to zero. 


(1) Show lim. G(r) = 0. 
(2) For k > 0, show that 


F 
tim £O) — const x p~! e? [L p) — L2%(p)] 


crm ow r 


where p = [2Zagr/(k + n)] and ay is the Bohr radius, that is, 
Ay = Anh? [me?. 
(3) For k « 0, show 


F 
lim iom const. x gl* ^! e^ »2 [nL p) — (2|k| + n) L2!(p)], 


c> W r 


where p = (2Zagr)/(|k| + n). 

(4) From the material covered in Section 8.4, it is clear that for k > 0, 
k=j+4=141. Also according to Magnus, Oberhettinger, and 
Soni (1966, p. 241), we have 


Lg) = LE (x) — LUPO). 
Use this relation to show that for k > 0, 


lim Eo 


cw F 


= const. x p! e ^2 [O! Yp). 
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(5) From Section 84, it is clear that for k < 0, |k| = j + 1 = 1. Also from 
Magnus, Oberhettinger, and Soni (1966, p. 241) 


xLg* Dx) = (n + q + DLP) — (n + DLR (x). 


Use this relation along with the result of part (3) to show that for 
k <0, 


lim W const. x p' e"? LE! H(p). 

exa T 
Note: the results of Problem 8.28 enable one to associate the solutions of 
Dirac’s equation with the solutions of the nonrelativistic Schrödinger 
equation. From the discussion immediately preceding Eq. (8.202), one sees 
that the total quantum number n, that appears in the Schródinger model 
may be identified with the sum |k| + n. Thus for k 20, n=l + 1 - n and 
we may identify n with Schródinger's radial quantum number n, since 
n, — h, T | -- 1. On the other hand, for k <0, n, — 1-4 n and n must be 
identified with the sum n, + 1. In either case, we see that 


L. F 
lim EO. const. x pe ^? LE +t Dp) 


coo F 
is the radially dependent factor in Schródinger's wave function. 


Problem 8.29. Show that Eq. (8.204) can be recast in the form 





N(F? — G? 
yy a E ae 
r 

where 

1 F? + G? j, 2FG 2F 

mo s( rf = 16 2 J of ) 

2 F^—G? rsin0(F? —G?)N (F? — G*)N 

and 





1 F? + G? p i 
P2 = E nd X( Hie (F2 + G?)N — * 2FGS 
2 A r sin 8 r 





re a (. te gE * ON + izres ) | 
where 
N = N7(0), S = sign(k) S7 (0), C = CFO), 
and 


A = (F? + G2.N? — AF? G?S?. 
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Note: P, and P,, are commuting projection operators. Aside from a 
normalization factor, Po is the result of applying a duality rotation and a 
boost operator in the $-direction to the projection operator 4(I + fo). Piz 
is the result of applying a rotation operator and the same boost operator to 
the projection operator 4(I + i43). 


9 


THE KERR METRIC BY AN ELEMENTARY 
BRUTE FORCE METHOD 


9.1 The Kerr Metric 


As already observed, Schwarzschild constructed a spherically symmetric 
solution to Einstein's system of field equations in 1916. However, the exact 
solution for a cylindrically symmetric rotating body was not discovered until 
1963 by Roy P. Kerr. This solution has since become the focus of almost all 
mathematical investigations into the nature of black holes. 

Professor Kerr (1963) used the formalism of spinors to obtain his 
solution. It would be quite awkward to use the formalism of differential forms 
to obtain the solution. However, Clifford algebra provides an alternative 
computational tool which is both powerful and elementary. 

Perhaps the most elementary derivation of the Kerr metric that has been 
devised is that which appears in Introduction to General Relativity by Adler, 
Bazin, and Schiffer (1975). Those authors were able to construct the Kerr 
solution of Einstein's field equations by a very skillful use of vector algebra. 
By using Clifford algebra, it is possible to streamline some of their computa- 
tions. The use of Fock—Ivanenko 2-vectors is particularly helpful in reducing 
Einstein's field equations to Eq. (9.34) below. 

The Kerr metric is a degenerate metric that can be characterized by the 
equation: 


ds — y, dx* (9.1) 
where 


^ 


Va = Ya — mW, wj. (9.2) 


In this chapter, the units are chosen so that c — 1. Because of the nature 
of Eq. (9.2), we will abandon our convention that Greek indices be restricted 
to coordinate systems of Dirac matrices. However, we will continue to use 
uncapped gammas for coordinate frames and capped gammas for an 
orthonormal noncoordinate frame. In particular 


Í Pr + 949; = 2ny 
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where 


Noo = 1, Nyy = Mg, =N33= l, and ny =O ifjsk. 


It should also be understood that m is an arbitrary constant which will 
eventually be interpreted to be the geometric mass of the source and the w,’s 
represent the components of a null vector. 

It is not difficult to show that 


Jt = $* + mw*w, Y, (9.3) 

gj = Aij, — 2mw,W,, (9.4) 
and 

g^* = n* + 2mw!w*. (9.5) 


Since w/w, = 0, it is clear that 
wy, = wi, — mw;w*3,) = wj. (9.6) 
For much the same reason 
w,y! = wj. (9.7) 


The Fock-Ivanenko 2-vectors have a particularly simple form: 


m „ô 
T, zx ED! y" Bat 99 (9.8) 


(See Problem 5.23.) 
From Eq. (5.71), a formula for the curvature 2-form in a coordinate 
frame is 


190, = VI; — V,T, + E, T; — E,T,. (9.9) 


For this computation, it is useful to take advantage of Eq. (5.44). In 
particular 


a 
VI, = 7T, - ET, + EE. (9.10) 


b red 


With this result, Eq. (9.9) becomes 


1906, = ur — VT,- (9.11) 
OÓx* 
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From Problem (5.14), we know that Einstein's field equations for a 
vacuum may be written in the form: 


LR, — 0. (9.12) 


This means that we are faced with solving the equation: 


a 
zals- V0. (9.13) 





y? 


To compute the first term in Eq. (9.13), we must determine the formula for 
Y, in the appropriate noncoordinate frame. We first note that 


m ô 
T; ——" — (ww, 
: 2! ox 0n ) 


m i Q 
ol (nu "yo gi av vw, ok 
5 CPt + mw^w,f) ^ GF mw?) = Qui) 


E (f + mw'w,2? + mww, 
Ó 
+ m?w!w*w,w,$*) 2 (wgw,) (9.14) 


Because of the symmetry of the dummy indices w;w,$^* = 0. 


Furthermore 
v v Ó 6 y w° 
á ox" cM axi Uh )=w LIEN 
Ow? é $ o 9 
= Wa —— = — (waw^) — w* — w 
° ox" Ax" ew ox" ° 
=0-—w? 2 Wo 
ox" 
Therefore 
à 
w' —— w, — 0. (9.15) 
Ox" 


It then follows that 


ð Q Ó 
mw"'w,$'* Bo (wgw,) = mir ( ww, ee + waw” — J = 0. 
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With these results, Eq. (9.14) becomes 


a 
pc Ea 0" ww wm p "wj! (gy) (9.16) 


If we define 


Ó 
Uy = w" 3 Wk (9.17) 


then it is possible to recast the second term on the right-hand side of Eq. 
(9.16). That is 


ô m? m. 
= — (wgw,) — — $"w;wgv, — ar 9" w;W,Ug (9.18) 


m 
p 2 "n 
put 2 


Ox 


Because of the symmetry of the dummy indices in the last term on the 
right-hand side of Eq. (9.18), that term is zero. We now have 


m ô m . 

T; = ——$""—— (wgw,) — — $"w;Wgv, 9.19 

B 2 ? 3s B ) 2 y" W;Wg ( ) 
We are now faced with obtaining a simple expression for 


Q 
B r= 40 : 
axe ? 0 ax ? 





(9.20) 








y 


If we now combine Eq. (9.19) with Eq. (9.20) and use the result .of Eq. 
(9.15) to eliminate the m? term, we have 








ô m e 
) p pp ww, 
Pe f 34 e p^) 
2. 
WW Quy 
IE w : 
5.5. Mew P 2s ow gw») 
m? cd 
m CA per (wjwgv,). (9.21) 


We note that 25" = 4fmv + m3» — gBvgm, 
For this reason both 3-vector and 1-vector terms appear in y/(0/0x")Y;. 
However, because of the symmetry of the dummy indices, it is obvious that 
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two of these 3-vector terms are zero. The remaining 3-vector term is 


2 2 2 e 


m ô m^, 
— = gww — — — (ww) = —— f" wow, w! — — — ws 
2 2 Ox* Ox" 


Ox* Ox" 
+ (terms which are zero either because 


w’w, = 0 or because of Eq. (9.15) 


The remaining 3-vector term is then also zero because of the symmetry of 
the 0 and v indices. Writing down the 1-vector terms now gives us 


2 














0 m fa] 
BI T, = (nbn — pesn 
y axe? 3 (n9 ? ss ax" (wgw,) 
nts e? 
— — (n9'$* — n*"$") ww? Wew, 
2 ? J") wg Aue aya ve ) 
Was ict 
2 (nPI — n$) ae (w,wgv,). (9.22) 


When we use the n’*’s to raise indices, some terms are eliminated simply 
because w*w, — 0. Other terms disappear because, from the definition of v, 
in Eq. (9.17), it is clear that w^v, = 0. Finally a consequence of Eq. (9.15) is 
that 


2 


jw'w? 
Óx* Ox" 


(wgw,) = 0. 


Applying these eliminations, Eq. (9.22) becomes 











a m 0? =| e? | 
E— T, = —_| 7” ™w,) | — — | ?"w'w? J|. (923 
Pee S | axe axr ] 2p s eo p 


We now turn to the task of obtaining a reasonably simple expression 
for VI. We first note that 


m ô 
I; = 73 am ax" (wgw,) 


m 
= T$. y" ^ y' V.(wgw,) 


m m 
=o y" ^ Vy" wgw,) + 5 y! ^ La y we Wy 


=F d(y"wgw,) + 0. 
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Thus it follows that 
m 
dr, = e dd(;"w,w,) = 0. (9.24) 


In turn, this implies that 
= (ô + dT, = ôT, (9.25) 


This means that if we can express T, in the form T, = 3E/^^y,, then we can 
use Eq. (7.17) to get 


or, “Tek = al Ej (9.26) 


(It should be noted that a review of the derivation of Eq. (7.17) reveals that 
it remains valid when the coefficients of the p-vectors have some unsummed 
indices.) 

To use Eq. (9.26) we need to know the determinant of g,,. We note that 
Jap = Pap — 2MW, Wp. Furthermore by a spatial rotation, we can have w, = wo 
and w, = w, = 0. Thus 


1 — 2m(w,)” —2m(w? 0 0 
—2m(w? -—1—2m(w)? 0 0 

g — det z—]. 
0 0 -1 0 
0 0 0 -1 


Now 


m Ó 
T, = ——-y"—— (w a Wy 
21 aya Waly) 


m ô 
eee 40 vp w w, 
z 109" 9” 5 Way) 
= P sug oi + 2mw wt) (waw) 
2 ox" 
m Ó 
Ede d w wÊ 
2 Too agi! aw?) 


m 6 
= —— yog(n" + 2mw"w*) — (ww? 
2 Yool Me ) 
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or 
m 


T,= 
2 


à 
Vopn” aa (waw?) — m^yg, w^[v,w? + w,v*]. 
Since jo, w^v,w? = 0 and y4, A9 = 1y4,49? — A9*), we have 
m ð 0 
r= = n"? — (w,w?) — n"? — waw”) 
4 «| rer aw’) aa ) 
m2 
-5 Yo [w°v?w, — woo w, ]. 
We can now use Eq. (9.26) to get 


m e? 8? 
ôT, = —— y, | n” ww? n waw? | 
2 "| ddp OE e 





—-my, E (wfw, v?) — a (veo 





m, . à? 
eS ($,— nw] tout — n” au art w^ 
= m(p — mw w$ ) ô (w^w v?) — NE (w?w v^) (9 27) 
Yo oW Vk Ox? x ax? x i ` 


We note that $,L1?(w,w*) = 3, 1? (w,n'^w,) = $"U1?(w,w,). Using this and 
similar manipulations, along with a slightly modified version of Eq. (9.15), 
Eq. (9.27) becomes 








m Ü* 
ar, = —" p| P9) - Lv 
a 3! | (w,w,) ox" 5x0 | | 
2 e? 
t > jw, Don) -w A dy Tl 


7% map| (w?w,v, —W, wah) | 
x 


+ Zo 5 evang i (9.28) 


Now we wish to combine Eqs. (9.23) and (9.28) to get Einstein's field 
equations: 


à 
=a I» — 9T, = 0. (9.29) 





y? 
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Since m is an arbitrary constant, we will set the coefficients of m, m?, and 


m? separately equal to zero. However, there is no m? term in y^ 0T,/óx*. 
Thus the m? coefficient in ôT, must be zero. That is 


ô 
0-2w' Ax zs (WWaty) 


ô 0 
= B 
= we (wiw,) + w"wPw, ~ 5 Py 


əx Ayh 


Ó 
zt B Y 
0+ www z Uv 


0 
= ww, 5 (w’v,) — w^w,v, mm (w") 
= 0 — w,(v,v”). 


Thus v is a null vector which is orthogonal to the null vector w. Generally 
two null vectors which are orthogonal to one another must be scalar 
multiples of one another. To show this we note that if v; = — Awọ, then 


y = —A(Wo, Wy, W2, W3) + (0,04, Uz, U3) = — Aw + u. (9.30) 
Since 0 = (v, wò = — A(w, wò + <u, w), it follows that <u, wẹ = 0. Also 
<v, v? = A?(w, wò — 2ACu, wò + <u, u» = 0. 
This in turn implies that 


0 = <u, u) = n"uu; = —[(uy)? + (u2)? + (u3)°], 





or u = 0. Thus from Eq. (9.30), we now have 


0 
wÊ Ox? Wg — + Ug = — AWg. (9.31) 


Now we introduce one more definition, namely 


2d icum (9.32) 


Using Eqs. (9.31) and (9.32), we can simplify Eqs. (9.23) and (9.28). We 
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then have 
ð m ô 
6 p-—$ L+ Aw, 
y axe? 2! Ex )w | 
m, ô ô dw? Q 
a E ss ax (pw) - "a axe w| 
om ô 
= L+A 
mu IP (( wy | 
2 
room (— Awgw, — ~ Any) |; 
and thus 
0 m ô 
B Tl, =— 9” L+A | 9.33 
Waele GEK )w,) (9.33) 


Applying the same procedures to Eq. (9.28), we get 
m, " [7 
or, a ?" E] (w,w,) t= (L + A)w,) 
2 Ox" 
m^, s n) 
+ Es $?wy| wE ww) + w" ERE ((L + A)w,) |. 
x 


Combining this result with Eq. (9.33), Einstein's field equations become 


ð 
5— T,—61,-0 
axe " j 




















Av 2 
= — | Wy W. 
2! (wa 


m? , 
an ep" 


Since the coefficient of m — 0, we finally have 


j] 





ə] 














Iw w,) E EAW) 
Ox” 





,) — 0. (9.34) 
Since 


w” E ((L + A)w,) =0 
Ox" 
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it follows that the coefficient of m? is zero whenever the coefficient of m is 
zero. 

If we now require that the metric tensor be time independent, the 
equations become substantially more tractable. To pursue this, it is useful 
to introduce a three component space vector 4 by the equation: 


(Wo, Wis W2; W3) = WoL, Ay, 42, 43). (9.352) 


In the immediately ensuing calculations, Latin indices will be restricted to 
1, 2, and 3. With this convention 


<A, A> = nV, = —[G4)? + (22)?  (] = - 1. (9.35b) 


More importantly, Eq. (9.34) becomes 


2 


Ox' 0x? 














(Wo)? = n" (Wo)? = V2(wWo)? = 0. (9.36a) 


V2((Wo)24,) = x. (L + A)wo), (9.36b) 
and 


V?((wo)^ 4, An) = Ld ((L + A)WoA,) — us ((L + A)woA,). (9.36c) 
ox" Ox" 


It should be noted that because of the metric that we are using, 


Ee 
ax* (x+)? (dx?)?’ (@x3)? 


Expanding the right-hand side of Eq. (9.36c), we have 

















04. ô 
V (Wo) 4, 44) = —— (L + A)wo — An —— (CE + A)wo) 
ôx ôx 
ô Q 
— An (L + A)Wo — Àm ((L + A)wo). (9.37) 
Ox" ox" 


Using Eq. (9.36b) to eliminate the second and fourth terms on the right-hand 
side of Eq. (9.37), we obtain 


(2 Am 








+ 


ax cel + A)Wo = —V?[(Wo)*AmAn] 


+ Ap V CWO) Am) + AmV"((Wo)?An)- (9-38) 
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Expanding each of the terms on the right-hand side of Eq. (9.38), we get 
several cancellations. Several of the remaining terms disappear because of 
Eq. (9.36a). The resulting equation is 








Ohn 4 Ohm —2Wo DÀ Oy Y Oy Oe (9.39) 
Ox" Ox" L+A  Ox'Ox p ox dx! 
where 
L+A 
R ee (9.40) 
2Wo 


Equation (9.39) relates the derivatives 04,/0x" to an orthogonal transforma- 
tion. If we define 








-n duse 
n= hH- E, (9.41) 
p ôx 
we note that 
1 6A, 1,,02; 
ie, = $m $43 j 
Q1,» e A Pao 2 ? 22) 
1 0A, 104; 1 OA, OA; 
zm), + 6" -—— no. oo pgm Uk 7^ 
Pas I) + 93 póx" *pôx" p?  Ox"óx" 
= Pre Pp. (9.42) 


From this result, we see that the Clifford numbers (7,, 7, 73) can be 
obtained from the Clifford numbers (f4, 2, f3) by an orthogonal transforma- 
tion—that is by a reflection or a rotation. It can be shown that the vector 
4 = Aĵ; is invariant under this transformation. 

If we use T to designate this transformation, then 7(};) = 7; and 


DUCUM 
TA) = JT) = 15; = 9, + - 9" d (9.43) 
p x 


Repeating the argument used for Eq. (9.15), it can be shown that 


A Phi =0. (9.44) 
ôx” 


Thus Eq. (9.43) becomes 7(4) = 47$; = 4. 

It is now clear that our transformation is either a rotation where 4 is 
the axis of rotation or a reflection with respect to some plane containing the 
vector 4. Useful results follow if we assume it is a rotation. The rotation 
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operator can be immediately written in the form 
č l5 34^ : é 
ANO c PO EU Pas + 27931 + 2033) ong 
=Ic 


ó 
pem k^ d 
JO i) sin 5 


= I cos 





dis 
"3 
eee č (9.45) 
2 2 


where J = 9493. 
Using the fact that J commutes with all Dirac matrices and J? = I, we 
have 





— 9" 
SN p ox 
= WEIR E) 
=| Lcos A JA sin pl: coss + JA sin : 
2 2 2 2 
7 RON TUNE NUES. 
=f, s^» + J(fjà — 49) sin; cos — Af, sin’ 5. (9.46) 


To simplify this expression, we observe that 
Ap, A = AU 3,4. = AUF n d Ny Px mi Da), ae nk.) = 24,4 + 9, (9.47) 


Incorporating Eq. (9.47) into Eq. (9.46) and using the fact that 
2 sin €/2 cos £/2 = sin é and 2 sin? £/2 = 1 — cos č, we have 





Sl agian A 
DE "c fy cos č — AA — cos &) — JA ^ fy sin č 
or 
M ee tek TN 
fex dye OU F4: JA. (9.48) 


where « = p(1 — cos č) and f = p sin é. 

The approach to be used here is to chip away at Eq. (9.48) and eventually 
obtain an equation for a + if. We will then use the solution of that equation 
to obtain a formula for å. For the ensuing calculations, it is useful to note that 


$n = Pye (9.49) 


25525. (9.50) 
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and 


py, = 3I. (9.51) 
Furthermore if a is a 1-vector then 
MA ^ a) = <A, Aya — (4, ay) à = —a — (A, aà, (9.52) 


A(A ^ à) - (A ^ a)4 — 0, (9.53) 
and 
(A ^ aya ^ a) = <A, Aa, a» + (KA, aò)? 
= (a, a) + (<å, ay. (9.54) 
(See Problem 9.1.) 


Multiplying both sides of Eq. (9.48) by 4, using Eq. (9.52) and then 
projecting out the O-vectors, we have 


me " 
<å, 9" P = —a((4 5,» — å;) 
or 
A" Ohy =0. (9.55) 
ox" 


Multiplying Eq. (9.48) by $7, we have 


ji" Oh; = (ni™ ET gm) 94, 
hdi Ox" 
= óÀ — då 
= —a(3 + Ad) + B59, A 
= —2a + 2Jfà. 


Equating 0-vectors and then 2-vectors, we have 


eA" 
62 = — = — 2a, (9.56) 

ox" 

and 
oA "s 

dà = 9" — = —2Jf4, (9.57) 

Ox" 

02! j 

— 2d« = dó4 = j" pot NU (9.58) 





Ox" 0x Axi’ Ox". 
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From Eq. (9.48) 


2. 047 , ; gcn 
?" ax™ = —a($ + MA) + JB, 
x 


and thus 
Ou, on OA 
—2d = ps AR = — å +A 
á ox! PPRA (25 E) 
2,4, Of ^ oa OAK 
MCI A JBS LL. 9.59 
Pap tue PN 
From Eq. (9.55), it follows that 
je 
ox! 


Equation (9.59) thus becomes 
—2da = —da — <å, da» 4 — a(5A)A + J(dB ^ A) + JfdA. 


Using Eqs. (9.56) and (9.57) and then reorganizing the resulting terms 
gives us 


da = —2(a? — f?)4 + <A, da» À + JA ^ df). (9.60) 
By multiplying Eq. (9.60) by 4 and using Eq. (9.52), we get 
Ada = 2(a? — f?) — <A, da» — JdB — JA, df» A. 
Projecting out 0-vectors and then 2-vectors, we discover 
<A, da» = o? — f?, (9.61) 


and 


AA da = —Jdft — Fd, df» A. (9.62) 


Equation (9.61) can be used to revise Eq. (9.60) and we can multiply Eq. 
(9.62) by J to obtain an equation for df. We then have 


da = —(o? — f?)4 + JCA ^ df), (9.63) 
and 


df = —44,df» 4 — JA ^ da). (9.64) 
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To obtain an expression for (4, df», we go back to Eq. (9.57) and 
then get 


Vda = —2J(Vf)4 — 2JfVA. 
Projecting out the 3-vectors, one gets 


0 = —2J«df, 1» — 2F BSA 
Or 


<A, Mf = — 5A = 2af. (9.65) 


If we now replace <A, df» by 2af in Eq. (9.64) and then combine the 
resulting equation with Eq. (9.63), we have 


d(x + if) = —(x + i^A — iJ(4 ^ d(x + if)). (9.66) 
Combining Eqs. (9.61) and (9.65), we also have 
<A, d(x + iff)» = (a + iB". (9.67) 
From Eq. (9.66), we have 
Vd(a + if) = —2(u + ify(V(x + if) A — (x + iB VÀ 


: T^i Ó A 
—iJ9 ak P, t in| 


Projecting out the 0-vectors, we have 
ód(« + if) = —2(% + 1B)(4, d(a + if)» — (x + kon 


-ip 22, (a + ip) +å; (x + ip (9.68) 


Ox! dx* 3 gx! Qx* = 


Because of the symmetries of the indices, 


ijk 
Y 





zi Bos Grip 


Furthermore 


pik = i(jüg* +9 j^). 
Using these observations, along with the fact that 


<A, d(a + if)» = (a + if and 64 = —2a, 
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Eq. (9.68) becomes 
ód(a + if) = —2(x + iB? + 2(x + iBy)la 


ye ; Jt) d(x + if) + (d(x + if)) då]. 


Since dA = —2Jf4, this becomes 


dla + if) = —2(a + if)? + 2(a + if)?a 


+ if(Ad(a + if) + (d(x + if))A). (9.69) 
But 
[Ad(a + if) + (d(x + iB)) 4] = 2¢A, d(x + if)» 
= 2(a + if)’. 
Thus Eq. (9.69) finally becomes 
dd(a + if) = V?(x + if) = 0. (9.70) 


Equation (9.70) is Laplace’s equation which has a wide variety of 
solutions. However, if we square both sides of Eq. (9.66), we get 


(dla + if)? = (a + ip)*(a)? 
+ iJ[A(A ^ d(x + ifl)) + (A ^ d(a + if) (o + ip)? 
— PA ^ d(a + iB))?. 


Using Eqs. (9.53) and (9.54), this becomes 
(d(z + if)? = —(x + iB)* — (dla + if)? — (<A, d(@ + i)". 


Using Eq. (9.67) one more time and reorganizing the resulting terms in this 
last equation, we get 


(d(x + if)? = —(a + ip}. (9.71) 


Alternatively, this can be written as 


1 2 
Ee] Sat, (9.72) 


In the context of classical optics, this last equation is known as the 
eiconal (or eikonal) or characteristic equation. It severely restricts the 
solutions for Laplace's equation that can be used for our purposes. 
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Once « and f are determined, it is possible to determine A. To pursue 
this, it is useful to define 





aree (9.73) 
Dividing Eq. (9.67) by (a + iB)?, we get 
—44, d(x + i) !» «1 (9.74) 
or 
ido bs —1. (9.75) 


In much the same fashion, we can divide Eq. (9.66) by (x + if)? and 
obtain 


d(p + ic) = à — iJ(À ^ d(p + io)). 


Multiplying this last equation by 4 and then projecting out the real parts 
gives us 


Adp = —I + JA(À ^ do). 
From Eq. (9.52), this becomes 
Adp = —1 — Jdo — JA, do» 4. 


However, from Eq. (9.75), (4, do» = 0. Thus we have Adpdp = —(I + Jdo)dp 
Or 


ja ee eap: (9.76) 
(dp, dp» 


It remains for us to determine wọ in Eqs. (9.36a,b, and c). Since 
& = p(1 — cos č) and f = p sin é, we have 


a? + ff? = 2p?(1 — cos €) = 2pa. 
From Eq. (9.40) 


(L + A) wo = 2p(wo)? = (wo)*(a? + B?)/a. 
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Using this relation, Eqs. (9.36a, b, and c) become 
V?(wo)? = 0, (9.772) 


(Wo)? 





ô 
V?[(wo)? 44] = ule (a? + "|. (9.77) 


VT) Antal = = al" (0? + P, 








? (oe (x? + B2), | (9.776) 


x" 


We see that if (wo)? is a constant multiple of o, then Eq. (9.77a) is satisfied 
and Eqs. (9.77b and c) take on a simpler appearance. It can be shown that 
a constant multiple of « is the most general solution for (wy)?. We first note 
that we have solved Eq. (9.39). Reviewing the origin of this equation tells us 
that Eq. (9.77c) is solved once we have a compatible solution for Eq. (9.77b). 

To show that « is a solution for Eq. (9.77b), we need to show that 


V?[x4] = —d(o? + f?). (9.78) 


Using the fact that the Laplacian of « is zero, the left-hand side of Eq. (9.78) 
can be rewritten as 


V?(g4) = 2n E: a + aV?4. (9.79) 
x 


x* 
To get an expression for 04/0x', we note that from Eq. (9.48) 


dA, 04; a 
Ax! = (tut -i = —a(ng AA) — Qu IBAA 0)». — (9.80) 


To compute — (f, J TBA ^ $ ĵ;) J, we > project out the 0-vector from the product 
—JB$ A ^ ĵ;) and thereby get —Jpa* ju; = IF JBA* 9 ni- 

To obtain an expression for 04/0x', we need to multiply both sides of 
Eq. (9.80) by ff. In so doing, we should observe that an immediate 
consequence of Eq. (9.49) is that 


PTBA i) = IB "ju = TBA ^ Îi 
Thus 


= = —a(f; + A4) + FBO ^ $2, (9.81) 
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and therefore 
an A Oh oad + A2, day) + 2FB(A ^ do). 
Ox* Ox! 


Using Eqs. (9.61), (9.64) and (9.65), this becomes 


2nF T = = —d(a? + f?) — 2a(a? + f?)4. (9.82) 
x* Ax 





To get an expression for the second term on the right-hand side of Eq. 
(9.79), we note that 


aV?4 = a(ddd + ddA). (9.83) 
Since 64 = —2« and då = —2JBA, it follows that 


adóA = —2ada = —d(a?) (9.84) 
and 
adda = aV(—2Jf4) = —2aJ(VB)A — 20BJVA. 


Projecting out the 1-vectors in this last equation gives us 
aðdå = 20J(A ^ df) — 2afJdA 


Using Eq. (9.63) along with the fact that dà = —2Jf4, this last equation 
becomes 


ddA = d(x?) + 2a(à? + £?)4. (9.85) 
Adding Eqs. (9.84) and (9.85), we get 
aV?4 = 2a(x? + f?)4. (9.86) 


Substituting the results of Eqs. (9.82) and (9.86) into Eq. (9.79), we obtain 
Eq. (9.78)—the equation we set out to validate. 

To show that (wg)? must be a constant multiple of «, we substitute 
(wo)? = f« into Eqs. (9.77a and b) and then show df = 0. 

From Eq. (9.77a) 


V?(fa) = (V?f)a + anti OF Oa + fV7«=0 
Ox! Ox! 


or 


. Of Ox 
2 Jg o———— 
(V?f)a + 2n Sos 0. (9.87) 
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From Eq. (9.77b) 


V*(fad) = —d( f(a? + B^), 








or 
_ of a af aa 
(pan + 2n9 oF 9*4 4 ay Fy 95 V eiui) 
ox! x! Ox! Ox! 


= —(x  f?)df — fd? + p’). 
Applying Eq. (9.87) and the fact that V?(x4) = —d(a? + B?), this becomes 


2an" of u = —(a? + pdf. (9.88) 
Ox! Ox! 


Using Eq. (9.81) for 04/0x!, we get 
— 2x (df + ACA, df » + 2ufJ(A ^ df) = —(a? + pdf. (9.89) 
Multiplying both sides of this equation by 4, applying Eq. (9.52), and then 


projecting out first the O-vectors and then projecting out the 2-vectors, 
we get 


0 = —(a? + f?)(4, df» (9.90) 
and 
—22?(4 ^ df) — 2apJdf = —(a? + PA ^ df). (9.91) 


We now see that (A4, df > = 0. With this result, Eq. (9.89) becomes 
(à? — f?)df = 22fJ(A ^ df). (9.92) 


If we multiply Eq. (9.91) by J and reorganize the resulting terms, we arrive 
at a slightly different equation: 


2afdf = —(0? — B?)J(A ^ df). (9.93) 
If we take the obvious linear combination of Eqs. (9.92) and (9.93), we get 
[a — pY + 4a? f?]df = (X? + p?y'df — 0. 


and thus df = 0 which is our desired result. 
Now let us return to the problem of finding a solution for 


VA(x--if)-0 and  (d(p-io)?--1, 
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where 


ptic= 





a+ ip 
An obvious choice is 


1 


x + iß = 1/r = (x? + y? + 27)74. 


However, this choice leads to what is known as Eddington’s form of the 
Schwarzschild solution. The solution (x? + y? + (z — a) * would only 
correspond to a physical translation of the spherically symmetric solution. 
However, the solution 


a + iB = (x? + y? + (z ia) (9.94) 


does indeed correspond to a physically meaningful solution which is quite 
distinct from the Schwarzschild solution. 

This solution is most naturally expressed in oblate spheroidal coordinates 
(Margenau and Murphy 1956). To understand this coordinate system, 
consider a family of confocal ellipses and a companion family of confocal 
hyperbolas in the xz-plane where both families share the same focal points 
(+a, 0, 0). See Figs. 9.1 and 92. 

In this coordinate system: 


x = |a| cosh u sin 8 and z = |a| sinh u cos @. 


The members of the family of ellipses are specified by the value of u. In 
particular 


x? z? 





oa eae sin? 0 + cos? 0 = 1. (9.95) 


On the other hand, members of the family of hyperbolas are determined by 
the value of 6: 


x? z? 


= cosh? u — sinh? u = 1. (9.96) 





a? sin? 0 a? cos? 0 


If @ is used to specify the angle between the xz-plane and the plane 
passing through our point and the z-axis, then: 


x = |a| cosh u sin 0 cos $, (9.972) 
y = |a| cosh u sin @ sin $, (9.97b) 


z = |a| sinh u cos 8 (9.97c) 
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Z 





Fig. 9.1. For a fixed value of u, the equation “x?/(a? cosh? u) + z?/(a? sinh? u) = 1" defines a 
member of the family of ellipses with focal points at x = +a and z = 0. For a fixed value of 0, 
the equation "x?/(a? sin? 0) — z?/(a? cos? 8) = 1” defines a member of the family of hyperbolas 
with the same focal points. 


where 


O<u<+0,0<0<7, and 0 € $ « 2x. 


In three dimensions, surfaces of constant u are oblate spheroids, surfaces 
of constant @ are single sheeted hyperboloids, and surfaces of constant $ are 
planes through the z-axis. (See Fig. 9.3.) 

In this coordinate system 

x? + y? + (z — ia)? = a? cosh? u sin? 0 + (|a| sinh u cos 0 — ia)? 
= a? (sinh? u — cos? 0) — 2ia|a| sinh u cos 0 
= (|a| sinh u — ia cos 0)? 
= (p + io}. 
Thus 
p = Jal sinh u (9.982) 
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Cartesian Coordinates. (x, y, z) 


A = (0, 0, taisinh uo) 
x? z B = (iaicoshug, 0, 0) 
F = (iat, 0, 0) 
P = (iatcosh up Sino, 0, iat sinh ug cosóp) 
V+ =V-= (iaisingy, 0, 0) 


ie fane t 
2 2 2. D 
a^cosh ug a^sinh Up 


or u-ug 














2 
2 is De 
a^sin^6o a^cos^eg 
or 6 -65 
x 
z 
porque 
a^sin^0o a^cos^0o 
T 
Oblate Spheroidal Coordinates (u, 9, ¢) or ð= 2- 90 
A = (ug, 0, 0) 
B = (Up, 1/2, 0) 
F = (0, 1/2, 0) 
P = (ug. 8g. 0) 
V+ = (0, 09, 0) 


V- = (0, 1/2 - 6g. 0) 


Fig. 9.2. The coordinates of some geometrically significant points. 


and 
o = —acos b. (9.98b) 


To get the metric for the flat space, we note that 


S = Pix + fay + az 
= 9,/a| cosh u sin 0 cos $ + 9,\a| cosh u sin 0 sin $ 
+ $4la| sinh u cos 6 
and thus 


0 i : : ; : 
a= 5 = ja|[ĵ; sinh u sin 0 cos $ + 5, sinh u sin 0 sin $ + f, cosh u cos 0]. 
u 
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z 





P=( up: $9 $9) 


Fig. 9.3. The location of a point P 1s defined by the intersection of three surfaces: an oblate 
spheroid, a single sheeted hyperboloid, and a plane passing through the z-axis. 


$9 = lal[?, cosh u cos 0 cos $ + ?; cosh u cos 0 sin $ — $4 sinh u sin 0]. 
and 


$5 = la|[ —?, cosh u sin 0 sin $ + ?; cosh usin 0 cos $]. 


If we retain the(—, —, —) signature for the three space components then 


Qu Ju Ge Jip 1 0 0 0 
Gut Guu Guo Gud 0 —a(sinh? u+cos? 0) 0 0 
Go Jou Goo Jop fo 0 — a? (sinh? u + cos? 6) 0 
Iot Jou Goo Goo 0 0 0 — a? cosh? u-sin? 0 


(9.99) 
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To compute 


_ (+ Jdo)dp 
(dp, dp» 


> 


we note that 





é 0 ô 
dp =| y" — +y — +7? Je sinh u = y"Ja| cosh u. 
p (> rate y oó la| y”lal 


00 
Similarly 
do = d(—a cos 0) = y^a sin 0. 
Also 
(dp, dp) = g""a? cosh? u = — 
Furthermore 


j E TYuTo 79 
laj* (sinh? u + cos? @)(cosh usin 0) 





In addition 


F Jo doo 
Jdod = = $ 
: a(sinh? u + cos? 0) — a(sinh? u + cos? 0) i 





acosh? u-sin? 6, 
(sinh? u + cos? 0) ue 
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(9.100) 


(9.101) 


(9.102) 


(9.103) 


(9.104) 


(9.105) 


Putting together Eqs. (9.101), (9.103), and (9.105), Eq. (9.100) becomes 


u 





ja la|(sinh? u + cos? 0) 


a sin? 0. 
cosh u 


y 


To obtain an expression for wo, we note that 


1 1 
p+io |a|sinhu — ia cos 0 





a+ip= 


_ la| sinh u + ia cos 0 
a^ (sinh? u + cos? 0) 





Thus 


sinh u 
“= . 
|a\(sinh? u + cos? 8) 





(9.106) 
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Therefore 


lai(sinh? u + cos? 8) 


cosh u 





(Wo, Wus We, Wy) = «(1 ,0, —asin? 0), (9.107) 


where 





A A 
wo =I eid | (9.108) 


la|(sinh? u + cos? 0) 


The null vector field w lends itself to a simple geometric interpretation. 
To pursue this interpretation, it is useful to obtain the upper index 
components. In Cartesian coordinates, we were able to use the flat space 
metric and its inverse to lower and raise the indices of w. This property of 
w must carry over to alternate coordinate systems. Thus we can use the 
inverse of the flat space metric of Eq. (9.99) to raise the indices of w. Doing 
this, we get 





; 1 1 
(w°, w", w9, we) = ZU -, 0, J (9.109) 
la| coshu' ` a cosh? u 


A null curve following the flow of this vector field must satisfy the set 
of equations: 


dt 
— =f, 


du — —f d dó of 
ds ds | la| cosh u’ ds 


=0, and = — m, 
ds acosh?u 


(9.110) 


where f is any continuous positive definite function of the coordinates. 

Since d0/ds = 0, any given member of this family of null curves must 
adhere to the surface of the hyperboloid corresponding to that fixed value 
of 0. 

It is well known that a hyperboloid is a ruled surface. It was observed 
by Boyer and Lindquist (1967) that the flow lines of the w field are the 
rulings of the coordinate hyperboloids shown in Fig. 9.4. To substantiate 
this, we merely note that not only do the flow lines of the w field adhere to 
the surface of the coordinate hyperboloids but they are straight lines in the 
flat metric. (See Problem 9.2.) 

Pairs of rulings cross at the waist of each coordinate hyperboloid. (See 
Fig. 9.5.) Examining the signs of Eqs. (9.110), we see that if a is positive, the 
null geodesics correspond to the rulings shown in Fig. 9.3 with the future 
direction toward the source. If a is negative, the null geodesics correspond 
to the alternate set of rulings with the future direction also toward the source. 

One must be cautious about reading too much into these figures. What 
is a straight line in the flat space metric may not be a geodesic or "straight 
line" in the curved space of the Kerr metric. The w field corresponds to flow 
lines which are geodesics in both the flat space metric and the Kerr metric. 
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Fig. 9.4. Members of the null vector field w are straight lines on the surfaces of single sheeted 
hyperboloids. 


However, there is no reason to believe that the alternate set of hyperboloid 
rulings correspond to geodesics in the curved space. In Problem 9.4, it is 
shown that in the Kerr metric there is a family of geodesics which adhere 
to the surface of the coordinate hyperboloids but which are clearly not 
straight lines in the flat space metric. 

Now let us turn to the problem of establishing a physical interpretation 
for a. Combining the flat space metric of Eq. (9.99) with the computed 
components of w in Eq. (9.107) in the manner indicated by Eq. (9.4), we 
have the line element for the Kerr metric: 


(ds)? = (d£)? — a?(sinh? u + cos? 0) [(du)? + (d0)?] — a? cosh? u sin? 6(d$)? 
2m sinh u 
lal(sinh? u + cos? 0) 


M 2 2 
- [a 3 ja\(sinh* u + cos* 0) 
cosh u 





2 


(9.111) 





du — asin? 0 ao | 


In most presentations p = |a| sinh u instead of u is used to label a 
particular oblate spheroid. The variable p equals half the length of the axis 
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YY 


Fig. 9.5. The intersection of a single sheeted hyperboloid and a plane tangent to the hyperboloid 
at the waist is a pair of straight lines. 


of rotation for the labeled oblate spheroid. (Half the major axis of the rotated 
ellipse is ./p? + a°.) Using this parameter, the line element becomes: 





2 
(ds)? = (dt)? — (p? + a? cos? of d z+ (2d — (p? + a?) sin? 0(d$)? 
a rp 
2mp p? + a? cos? 0 2 i 
yen oe a oe —__________dp — asin? 6d . (9.112 
pra e»! pta E ? ( ) 


The line element of Eq. (9.111) has three off-diagonal terms: dt du, dt dọ, 
and du dọ. It is possible to eliminate dt du, and du d$ by a coordinate 
transformation. In particular, instead of measuring the angle of longitude @ 
from the xz-plane, it is useful to measure it from a surface through the z-axis 
whose location depends on u (or p). In a similar spirit it is useful to reset 
our clocks on each oblate spheroid. Doing this we have, 


t = Î+ Alu), (9.113a) 


$ = $ + B(u), (9.113b) 
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dt = dé + A'(u) du, (9.114) 
dọ = dd + B'(u) du. (9.114b) 


When one substitutes these formulas for dt and dọ into the line element 
of Eq. (9.111) and then sets the coefficients of dt du and d$ du equal to zero, 
one obtains two linear equations for A’(u) and B'(u). When these equations 
are solved, one gets 


2m|a| sinh u cosh u 








A'(u) du — : 
(la| cosh? u — 2m sinh u) 
2mp 

= ———————— d 9.115a 

p? +a? — 2mp i ( ) 
and 
inh 
B) dus 2ma sinh u 


~ a} cosh u(Ja| cosh? u — 2m sinh u) 


2map 
= _—_ dp. 9.115b 
(p! X ap? +a? — 2mp) ^ ort 


These equations can be integrated with respect to p by the method of 
partial fractions but it is a nuisance to do so because one must consider 
three cases: |a| < m, |a| = m, and |a| > m. However, to get the line element, 
it is unnecessary to carry out the integrations. 

Using Eqs. (9.113a and b) along with Eqs. (9.114a and b), one 
obtains 





T | 2m sinh u Je» 


la|(sinh? u + cos? 0) 


la|* (sinh? u + cos? 0) cosh? u 
(la| cosh? u — 2m sinh u) 





(du)? 
— a*(sinh? u + cos? 6)(d0Y? 


(do)? 





: 2|a|m sinh usin*0 |, , 
— [e cosh? u sin?0 + la| | 


sinh? u + cos? 0 


4ma sinhusin?@ |, ; 
la\(sinh? u + cos? 6) 





(9.116) 
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equivalently 
2mp 

ds)? = | 1 — ———— ———- Kdty? 

ten ( (p? + a? cos? ; P 

(p? + a? cos? 0) 


dp)? — (p? + a? cos? 0Y(d0)? 
p aum O (p* +a (dé) 





2a?mp sin* 0 


2 2 3A 
— + a^) sin^ 0 + ———— ——— 
[o ! p? + a? cos? 0 


[ase 


4amp sin? 0 a a 


Except for a sign discrepancy in the off-diagonal term, the coordinate 
system of Eq. (9.117) is one of several discussed in a paper by Robert Boyer 
and Richard Lindquist in 1967. Since that time, these parameters have 
become known as the Boyer—Lindquist coordinates. This coordinate system 
enables us to obtain a physical interpretation for a. 

In 1918, Von J. Lense and Hans Thirring published an approximate 
solution to Einstein's field equations for a slowly rotating spherical source 
with a weak field. Their computed line element was 


(ds)? = (1 à 2 Jap = (: + MN eax? + (dy)? + (dz)?] 
peepee, (9.118) 
r 


where k is the gravitational constant, M is the mass of the source, and J is 
the angular momentum of the source. 

The line element of Lense and Thirring can be obtained from Eq. (9.117) 
by approximation. We first expand the coefficients in Eq. (9.117) in 
powers of (a/p) and retain only the zero and first-order terms. Doing this 
we get: 








(ds)? = (1 ma (: 2) «py — p*((d0)? + sin? 6(d¢)?) 


in? 0.5. 
dun un d dd: (9.119) 
p 


We now substitute p =r + m and then expand the terms of the resulting 
line element in powers of m/r. If we retain only the zero and first-order terms, 
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we have 
(ds)? — ( — i)a — ( + "can? + r?(d0)? + r? sin? e(dóy^] 
r r 
4am . AS lon 
+ —— sin? 0 dọ dé. (9.120) 
r 


Comparing this equation with that of Lense and Thirring, we see that they 
match if we insist on two identifications. One is the usual one of matching 
kM with the geometric mass m. The other one is 


Ma - J. (9.121) 


Thus “a” may be interpreted as the angular momentum of the source 
per unit mass of the source. 

It should be noted that there is a certain amount of arbitrariness in the 
sign convention. Had we chosen to use the complex conjugate on the 
right-hand side of Eq. (9.94) for the solution of Laplace's equation, we would 
have ended up with negative signs in the off-diagomal terms in Eqs. (9.116) 
and (9.117). In turn, this would lead to the interpretation of “a” as minus 
the angular momentum per unit mass. 

The sign convention used in this text is that used by Subrahmanyan 
Chandrasekhar in his text, The Mathematical Theory of Black Holes (1983). 
However, it is opposite to that originally used by Roy Kerr (1963). 


Problem 9.1. Verify Eq. (9.52). Hint: write in component form and use the 
fact that yy” = y¥* + g¥yk — g'*4J. Also verify Eqs. (9.53) and (9.54). 


Problem 9.2. Carry out a variation calculation on 


dx' dx/ 
— 2mw,w ) — —— d 
fos mw;w,) ER S 


to obtain the set of equations for the geodesics for the Kerr metric. Then 
show that if 





the system reduces to 
d 
— (fw,) = 0. (9.122) 
ds 


Show that this equation is satisfied if df/ds = Af?. (You may wish to 
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use Eq. (9.30).) Note: Eq. (9.122) may also be written as 


du s 
—x*=0. 

ds? 
This means that the null geodesics that follow the flow of the w field are 
straight lines in the flat metric. 


Problem 9.3. Show that aside from members of the w field, a vector which 
is null in the flat metric is not null in the curved space of the Kerr metric. 


Problem 9.4. The w field is known as a “principal null congruence.” The 
word “congruence” is used to indicate that the w field generates a space 
filling system of curves which do not intersect one another. The word 
“principal” is associated with the role that w plays in the structure of the 
metric. From symmetry arguments, it was known by Kerr that a second 
principal congruence of null geodesics had to exist. 

One might suspect that this alternate set of geodesics might be the 
alternate set of rulings for the single sheeted hyperboloids. However, this is 
not quite true. For one thing, although these rulings are straight lines in the 
flat metric, we have no reason to believe they are geodesics in the curved 
metric. 

To determine this alternate congruence, we can take advantage of a 
symmetry in the Kerr metric of Eq. (9.117) expressed in the coordinates of 
Boyer and Lindquist. It is invariant under the replacement of dp by —dp. 
(Alternatively we can change the signs of both dó and d£) This means that 
if we obtain the upper index components of w in the Boyer-Lindquist 
coordinate system, one can change the sign of the w, component to obtain 
an alternate null congruence. 


(1) Use Eqs. (9.109), (9.114a and b) along with (9.115a and b) to 
determine that for the Boyer-Lindquist coordinate system: 


p^ +a? a 
ee geb lec ee 
p* t a^ — 2mp p^ + a^ — 2mp 





(wi, w^, w°, w?) = zl 


(2) Let k = (wi, w^, —w?, wê) and then use Eqs. (9.114a and b) along 
with (9.115a and b) to show that in the coordinate system of Eq. 
(9.112) 


k = (kf, k?, k®, k*) 


i E +a? + 2mp a(p? + a? + 2mp) | 
= x* 5 > > d 
p? + a? — 2mp (p? + a?)(p? + a? — 2mp) 








This means that for the coordinate system of Eq. (9.111), where 
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p = Jaj sinh u: 


k= qnasi = ot? ene oe ) 
Q la|coshu' | Qa cosh? u 





where Pla| cosh? u + 2m sinh u and Q = |a| cosh? u — 2m sinh u. 
Compare this last equation with Eq. (9.109) and show that when 
u — +0, or m > 0, the flow lines, represented by the vector field 
k, asymptotically approach the alternate set of hyperboloid rulings 
with an outgoing direction. 

Students of black holes are usually more interested in the 
behavior of the null vector field near the source. If one approaches 
a source for which m z |a| then one encounters an apparent sing- 
ularity on the oblate spheroid defined by the equation p =m + 
./m? — a? where p? + a? — 2mp = 0. This is a coordinate singular- 
ity that can be removed by simply factoring out (p? + a? — 2mp) ! 
and renormalizing the vector field k. Nonetheless, the surface of this 
oblate spheroid is physically interesting. 

For one thing, a sign reversal occurs when one crosses the 
surface. Outside the surface, k represents an outgoing field of null 
geodesics. Just inside the surface, the field becomes ingoing. 

Show that when p > m + J/m? — a?, the vector field (K', k^, k?, k®) 
approaches some scalar multiple of (o? + a2, 0, 0, a). Since k? = 
k? = 0, these members of the k congruence correspond to unstable 
circular orbits for photons parallel to the xy-plane formed by the 
intersection of various hyperboloids with the oblate spheroid defined 


by the equation p = m + ,/m? — a?. 


a 


w 


(4 


— 


Further physical consequences of this surface and other surfaces near a 
black hole are discussed in almost any current text on general relativity. 


10 


PETROV'S CANONICAL FORMS FOR THE 
WEYL TENSOR AND ANOTHER APPROACH 
TO THE KERR METRIC 


10.1 ^ Petrov's Canonical Forms for the Weyl Tensor 


In seeking out solutions for Einstein's field equations for empty space, one 
can save considerable labor by imposing symmetry conditions on the 
Riemann tensor. These conditions are particularly useful if they also 
guarantee that the Ricci tensor is zero. Aleksei Zinoveivich Petrov (1954, 
1969) devised a classification scheme that divides all Riemann tensors into 
six classes. 

Users of the formalism of spinors have successfully exploited the use of 
these symmetry classes to construct a substantial number of solutions to 
Einstein's field equations. Much of this work was summarized in Exact 
Solutions of Einstein's Field Equations by Kramer et al. (1980). However, 
these same symmetries can be exploited with the formalism of Clifford 
algebra. 

At the end of this section, it is shown how symmetries imposed by the 
Petrov classification can expedite calculation of the Schwarzschild metric in 
a rather trivial manner. Section 10.3 in this chapter is devoted to showing 
how the Petrov symmetries, along with the Bianchi identities, can be used 
in a somewhat different manner to obtain the Kerr metric. 

This section is devoted to determining the canonical representation 
for each of the six Petrov classes. The next section is devoted to finding 
the principal null directions for each of these canonical representations. 
In the third and last section of this chapter, we adjust one of Petrov's 
canonical forms so that one of the principal null directions conforms 
with the null vector that appears in the line element of the Kerr metric. 
The resulting symmetries imposed on the Riemann curvature 2-forms 
are then exploited to obtain the Kerr metric in a straightforward 
manner. 

The Petrov classification scheme for the Riemann tensor R;,,,, may also 
be considered a classification scheme for the Weyl or conformal tensor Ci pm 
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where C;,,,, is defined by the equation: 


ijkm 


Ciim = Ris — RiteGmy + Rygpdmy + 3RGitGmyj- (10.1) 


The square brackets in Eq. (10.1) indicate that an antisymmetry is 
imposed on the indices m and k. In particular: 
Ciim = Ry jem — 3(Rie Gy = RimGx;) 
+ (Rx Gmi — Ria) + ERG Imj ^ Gm GIui)- 
We note that when the Ricci tensor R,; is zero, C, xm = Rijm. The reader 
should check the claim that 


Ca = 0. (10.2) 


Because of Eq. (10.2), the Weyl tensor is said to be the “trace free part” of 
the Riemann tensor. The reader should also check that the Weyl tensor has 
the same symmetries as the Riemann tensor, that is 

C, jem = Cp jim (10.3) 
and 


C, Um] ix 0. (10.4) 


To analyze the Petrov classes, it is useful to use an orthonormal 
noncoordinate frame; that is 


PP t+ Pedy = 2n. (10.5) 


The Weyl tensor will be treated as a matrix. That is, if a = Ja";, and 
b = 3b"3,, then b = Ca if 
bi = C#3?’ E C4?! + CU zat? + C9 gat? 4 C927? + C, a??. 


(10.6) 
Writing C as a matrix, we have 


Cua Ces: Cc. CP Ca Cao 
C31 C. Cs Cu C s Can 
ó C5, Cre Cu, Clio C5 C 10.7) 
T Jet CM, c9, c9. c. c | O 
Co. Cos c?9 C724 C9 62955 


30 30 30 30 30 30 
C"35 C'"34, OM 2 C Pio C39 C30 


252 CLIFFORD ALGEBRA 


For a Weyl matrix there is a substantial amount of symmetry among 


the components. For example, since C'; — 0, it follows that 


C, = C = CP, + C712 + C13 = 0 
or 
Cm + eg t C313, = 0. 
Similarly 
C^ = C7929 + C171. + C733 = 0, 
C = C. + CH. + C**23 = 0, 


(emg = on + Gus + (s = 0. 


Adding Eqs. (10.8)-(10.10), we have 


2(C!9,5 + ear + C750) s (C?;, * Qu + C15) =0 


Combining this equation with Eq. (10.11), it is clear that 
Cri + Ca + Co = 0. 
Subtracting Eq. (10.12) from Eq. (10.8) gives us 
C16 = C793. 
Similarly 
CR = CU 


and 
30 "E 12 
C?) = Cl*is. 


(10.8) 


(10.9) 
(10.10) 


(10.11) 


(10.12) 


(10.13) 


(10.14) 


(10.15) 


Thus we see that the diagonal entries in the 3 x 3 block in the lower 
right-hand corner of the matrix in Eq. (10.7) are identical to the correspond- 
ing diagonal entries in the upper left-hand corner of matrix C. Also from 
either Eq. (10.10) or Eq. (10.12), we see that the traces of these 3 x 3 blocks 
are zero. In actuality not only are the diagonal entries of these 3 x 3 blocks 
identical but the entire 3 x 3 blocks are identical to one another and 


symmetric. 


To see that the two blocks are symmetric, we note for example that 


12 d S — 31 
C 3177 Ci23i = C3112 = C 12 


and 


20 = = — £730 
C 307 C2030 x — C3020 =C 20° 
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The identity of the two blocks follows from the fact that C7, = 0 for 
j,k = 1,2, or 3 and j #k. For example 


C2; = 0 = PLUME + Cr, 
or 


20 _ 12 
C^3o = C ^3. 


Now consider the remaining 3 x 3 blocks. As will be shown, these two 
blocks are also traceless and symmetric. They are not equal to one another 
but one is the additive inverse of the other. 

To show that the trace of the upper right-hand block is zero, we note that 


3Ciz23110 = 0 = C2310 + C3120 + C1230 = C7719 + C129 + CY. 


The fact that the 3 x 3 block in the upper right-hand corner is symmetric 
follows from the fact that C*, = 0 for k = 1,2, and 3. For example 


Cto = 0 = CU, + CM 93 = - CU 99 Cae 
or 


12 _. (731 
C ^30 = C^ go. 


To show that, except for a sign reversal, the 3 x 3 block in the upper 
right-hand corner is equal to the 3 x 3 block in the lower left-hand corner, 
we observe that for i, j, k = 1,2, or 3 


jk "e" P iO 
C i0 T Cirio Tet Cio jn --—C ik 


To summarize 


M N 
c=| | (10.16) 
-N M 


where both M and N are traceless and symmetric 3 x 3 matrices. One is 
tempted to say that this form of C represents the most general form of a 
Weyl matrix or a matrix representing the curvature for empty space using 
an orthonormal noncoordinate basis. Actually this is almost but not quite 
true. The Weyl matrix must also be consistent with the Bianchi identities. 
(See Appendix A.3.) 

Nonetheless if we replace the Weyl matrix C on the left-hand side of Eq. 
(10.16) by the corresponding matrix for the Riemann tensor, the resulting 
matrix equation may be regarded as the complete set of Einstein's field 
equations for empty space. In this context it is assumed that the Riemann 
tensor matrix used to replace the Weyl tensor matrix is derived from 
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some generalized metric. This guarantees that the Bianchi identities are 
satisfied. 
At the end of this section it is shown how this form of Einstein's field 
equations can be used to expedite the solution of the Schwarschild metric. 
To study the Petrov classes, it is useful to modify Eq. (10.6). In particular 
it 1s useful to replace the equation 


a? b23 
a?! p 
M N a? p? 
p zi XLI (10.17) 
a?! p29 
q?9 p?? 
by the equation 
a23 
PEE 
if I -ill M NWI d I in| a? 
Js dos wha ies d a1? 
q?° 
q3° 
p23 
p?! 
1 -il] b" 
ZI A pio 
p?9 
p?9 
or 
a? Li ia? p23 ipi? 
a! — jg?? p?! — jp?9 
M iN 07| at? — ia?? p — ib?? 
| 0 iam a!? —ig? | | bt — ip? 
20 _ ig?! p29 As ib?! 
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This last equation may be decomposed into two equations: 


a?? = ial? p?? zn ip! 

[M +iN]} à?! — ia? |= | p?! — ip29 (10.18) 
at? t ia?9 bi? us ib?? 
a?? + ial? p23 $ ip!? 

[M — iN] a? + ia?9 | =| b?! + ib?° (10.19) 
a!? + jq? p!2 + jp30 


Equation (10.18) may be considered the result of applying the projection 
operator 4(1 + iJ) to Eq. (10.17). Similarly Eq. (10.19) is the result of 
applying the projection operator 3(I — iJ) to the same equation. It is the 
matrix M + iN = P that is known as the Petrov matrix. 

One should ask whether or not it is possible to simplify the Petrov 
matrix by a Lorentz transformation. Ideally one would like to find a 
transformation which diagonalizes C. Such a transformation would also 
diagonalize P. In general, this is not possible. However, one can generally 
find a Lorentz transformation which will make most of the components of 
P equal to zero. 

To see how the Petrov matrix behaves under various Lorentz trans- 
formations, let us see first how it behaves under a rotation in the xy-plane. 
We note that 


; ; 0 0 0 0 
923 _ i919) = | I cos — + $1? sin 5) $23 — jg10 (: cos — — $12 sin 
($ $^) ( 3t? 5 (9 ij) 2,29 5 


0 0 0 0 
=[Jcos—+ ?!?sin s) Ici "(i cos — — $1? sin s) 
( at? 5 ( 0$ 374 5 


0 0 0 0 
-(I-iJ P cos — — $1? sin 30 cos — — $!? sin — 
( )? 3-4 7 3; 7 > 


= (I + iJ)$?*(I cos 0 — 9! sin 0) 
= (I + iJ($? cos 0 + $?! sin 0). 





Thus 
$23 — j$19 = ($23 — i919) cos 0 + ($3! — if?) sin 0. 
Similarly 
$n i920 = (923 if!) sin 0 + (P " if?°) cos 6, 
while 
$12 = 12 and $30 = 730. 
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The coefficients a* transform in the same manner as the Clifford 
numbers 7^. Thus 


a = R,à, (10.20) 
where 
a? 2 iq! 
a =| a?! — ia?’ (10.21) 
at? — iq?” 
and 


cos? sinô 0 
R,=| —sin@ cos@ 0j. (10.22) 
0 0 1 


With this notation, Eq. (10.18) becomes Pa =b which means that 
(R,PR; )R, = R,b or (R,PR;')a = b. Thus we have 


P' = R,PR>}. (10.23) 


In a similar fashion, for a boost in the z-direction 


B = I cosh — 93° sinh £ 
and 
P' = B,PB;! (10.24) 
where 


cosh@  -—isinhó 0 
B,=|isinh@  coshó Of}. (10.25) 
0 0 1 


More generally, for any Lorentz transformation P' = LPL ! where L is a 
member of the group SO(3, C)—that is the group of 3 x 3 complex matrices 
with the property that Le SO(3, C) = L^! = L' (the transpose of L) and 
det L — 1. This group is a representation of the proper Lorentz group but 
not a faithful representation. In the group SO(3, C), the transformation which 
reverses the orientation of all four Dirac matrices (y* — — y^) is represented 
by the identity matrix. The group SO(3, C) is isomorphic to the group of 
proper orthochronous Lorentz transformations. (Orthochronous Lorentz 
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transformations are those which do not reverse the sign of the time 
coordinate.) 

Now that we have developed some machinery, let us turn to the task of 
finding canonical forms for the Petrov matrix P. We note that PT = P. Thus 
when P is real, one can use three orthonormal eigenvectors of P to construct 
a member of SO(3, R) to diagonalize P. 

Some aspects of this situation remain the same when P is complex. For 
one thing it is not difficult to show that eigenvectors that correspond to 
distinct eigenvalues are orthogonal. In particular, if P?, = 4,0, and Pb, = 
A305, then 





AID, = BIPD, = VIP", = (PV,) 9, = 4,010. 


Thus (A, — 4,)?19, = 0 or 919, = 0 if A, # Ay. 

What is different for a complex P is that an eigenvector ? may be a null 
vector; that is, it is possible that 9T? = 0. Such an eigenvector cannot be 
used to construct an orthogonal matrix which might diagonalize P. When 
P does have such a null eigenvector, P cannot be diagonalized by a Lorentz 
transformation. 

We will now see what can be done. Let us first suppose that P has at 
least one non-null eigenvector. It is possible to multiply such an eigenvector 
by a complex scalar so that the resulting vector ? has the property that 


B'S = 1. Any such vector ? must have the form 


a t ib 
-|c-cid 
et+if. 


Since ?'? = 1 > 0, it is not possible to have a = c =e = 0. Clearly there 
exists a real orthogonal matrix R representing a spatial rotation such that 


a a’ 
Ric} =| 0 where a’ > 0. 
0 


e 


Applying the same rotation matrix to the vector ?, we get 


a + ib a’ + ib’ 
BW = R? = R| c+id | = id' 
e tif if' 


Since the rotation matrix preserves the product 279, it is clear that 
(a' + ib’)? — (d? — (f = 1 and thus b’ = 0. 
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An additional rotation in the yz-plane gives us a vector 0” with the form 


L 


a 
ə?” =| id" where a’ >0 and (a)?-— (d"))- 1. 
0 
From the form of the boost matrix of Eq. (10.25), it is clear that we can 
apply a boost operator in the z-direction with a’ = cosh $ and d" = —sinh @ 
to obtain 
1 a 
0| = B,| id" 
0 0 


Since P is symmetric, we can write 


Z3 Z5 Zg 


However, if the rotations and the boost discussed above have been applied 
to P, we have 


1 
P|0|-2A0 
0 
and thus 
A, 0 0 


P= 0 Z4 Z5 


0 Z; Z6 


If P has a second non-null eigenvector w which has been normalized so 
that WTW = 1, then w can be represented in the form 


0 
w=|a+ib{ where (a)? +(c)? » 0. 
€ t id 


Applying a suitable rotation in the yz-plane and a boost in the x-direction 
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will transform the vector w into the form 


0 
1 
0 


When the same Lorentz transformations are applied to P, P must have the 
form 


AS 0 0 
P = 0 Ay 0 
0 0 z 


In this situation, it is clear that P has a third non-null eigenvector 


0 
0 


with eigenvalue zę. Thus we see that if P has at least two non-null 
eigenvectors, it also has a third non-null eigenvector and one can find a 
noncoordinate orthonormal frame such that 


A, 0 0 
P=|0 4, O | wheres, +4,4+/,=0. 
0 0 A 


If none of the eigenvalues are equal, the class of matrices that has this 
canonical form is known as Petrov’s class I. 

If two of the eigenvalues are equal but not zero, the class is known as 
class D. In that case 


—24 0 0 
P=| 0 A40 
0 04 


If all of the eigenvalues are equal, they must be zero. In that case we 
have a flat space and the class is known as class O. 

In Petrov's original classication, class D was considered to be a not very 
special case of class IL. However, the study of principal null directions has 
shown that it is useful to treat D as a distinct class. 
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Now let us investigate what was originally labeled Petrov's class II. 
Members of this class have one nonnull eigenvector but no more. As before 
there exists a Lorentz transformation that will give the normalized nonnull 
eigenvector the form 


When this same Lorentz transformation is applied to the matrix P, it must 
assume the form 


A, 0 0 
P=|0 Zi Zik (10.26) 
0 z, 23 


The 2 x 2 block in the lower right-hand corner must have a null eigenvector. 
The corresponding eigenvector for P must have the form 


0 
+2 


iz 


No generality is lost if we choose the plus sign. We should note that if such 
a vector with a minus sign is subjected to a 180? rotation in the xy-plane, 
one will obtain the version with the plus sign. This same rotation will change 
the sign of z, in Eq. (10.26) but it will otherwise leave the form of P 
unchanged. We now have 


10 010 0 
O0 z, z;|l| 1 | =A,] 1 
0 z 234Li i 
An immediate consequence of this relation is z4 + iz; =A, and z; + iz; = i45. 
Using these equations to eliminate z, and z}, we have 
A 0 0 
P = 0 Ag =; iz, Z5 
0 Z5 Az + iz 
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Since the trace of P is zero, it follows that 


297 0 0 0 00 
P=| 0 A O|ez|0 -i 1 
0 O02 0 1 i 


Applying a rotation in the yz-plane gives us P’ = R,PRT where 


1 0 0 
R,-|0 cos0 sin 


0 —sin@ cos 


Carrying out the matrix multiplication, we find 


-24 0 0 0 00 
P=| 0 A4 0|«eze^|0 —i 1 
-L0 04 0: Y 


If we now apply a boost in the x-direction, we have P" = B, P'BT where 


1 0 0 
B,=|0 coshó  —isinh |. 
O isinhó  coshéó 


Completing this computation, we have 


-24 0 0 0 00 
P’=| 0 A 0|vz,e72*0 —i 1 
0-04 0 1 i 


If z, 2 0, P" would be a representative of class D. However, by a 
judicious choice of 0 and $, we can equate z, exp 2(i0 + $) to any nonzero 
scalar. If we choose that scalar to be i, we have 

—24 0 0 
P= 0 A+1 i 
0 i A—1 


If å # 0, P is considered to be a representative of class II. If 4 = 0, P is said 
to be a representative of class N. 


262 CLIFFORD ALGEBRA 


Now consider the last Petrov class (class III). If P had more than one 
eigenvalue, it would have to have at least two eigenvectors. In classes I and 
D, we covered the case for which there are two nonnull eigenvectors. In 
classes II and N, we covered the case for which one eigenvector is null and 
the other is not null. This leaves open the possibility of two null eigenvectors. 
Ifthe two null eigenvectors had distinct eigenvalues, the two null eigenvectors 
would be orthogonal. However, that is not possible in a 3-dimensional space. 
(See Prob. 10.1.) This means that all three eigenvalues must be the same. In 
turn, this implies that all three eigenvalues must be zero since the trace of 
P is zero. We will see that if all three eigenvalues are zero and there is no 
nonnull eigenvector then there must be only one eigenvector which must be 
null. 

As in the cases previously discussed, one can adjust the form of the null 
eigenvector by a Lorentz transformation. Since the corresponding form of 
the Petrov matrix P must be compatible, we have 


0 Zi Z 234] 0 0 
P| 1 |=] z3 z4 25 |} 1 |=} 0 
i Z4 Zs Zgl|Ll 0 


From the three component equations, one can eliminate z3, Z4, and zę. We 
then have 


Z, 24 ë iz 
P=] 2, —-1i2Z5 zs 


1Z, Z4 iZ; 


Since the trace of P is zero, z, = 0. We now have P = z,G — iz,G? where 
G=|1 0 0 and G? — 0. 

If we now apply the same rotation and the same boost to our class III Petrov 

matrix that we did to our class II Petrov matrix, we find 


P-ze9*9G — iz, e?09* )g?, 


If z,=0, P is a representative of class N. Otherwise, we can choose 
z,exp(i0 + @) to be any nonzero scalar. If we let z, exp(i0 + ¢) = 1 and 


PETROV'S CANONICAL FORMS 263 


replace —iz, exp 2(10 + $) by z, we have 


0 1 i 
P= Z iz 
i iz —z 


We can now summarize the canonical representations of the Petrov matrices. 
This is done in Table 10.1. 

In the next section, it will be shown how the symmetry of principal null 
directions is related to the six Petrov classes. Investigators using the 
Newmann- Penrose formalism of spinors have been able to exploit these 
symmetries to obtain exact solutions to Einstein's field equations. Much of 
this work is summarized in Exact Solutions of Einstein's Field Equations by 
Kramer, Stephani, MacCallum, and Herlt (1980). 

There is no reason to believe that similar results could not be achieved 


Table 10.1 
Class O 000 
P=|0 0 0 
00 0 
Class I 4, 0 0 


= A, +4, +4; =0 and 
PIA ST arit e 


0 0 A; 
Class D —24 0 0 
P-| 0 AO A#0 
0 04 
Class II —24 0 0 
P= A+] i A#0 


Class N 


0 
0 
0 

Class III | 1 d 
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using other formalisms such as Clifford algebra. To demonstrate this, let us 
consider an almost trivial example. 

If we desire a spherically symmetric solution, the principal null directions 
would have to be radial. In the next section, it will be shown that most of 
the Petrov classes do not have the high degree of symmetry required for a 
spherically symmetric metric. The canonical form of a class D Petrov matrix 
has two principal directions (fọ + },). These two principal null directions 
can be interpreted as geodesics for incoming and outgoing photons if ?, is 
interpreted as a unit vector in the radial direction. 

The canonical form of the Petrov matrix for class D is 


—24 0 0 
P-| 0 40 
0 0 4 
If å = « + if, then 
—22 0 0 —2p 0 0 
M=| 0 «0 and N=; 0 pf 0 
0 Oa 0 0g 


Matching this up with Eqs. (10.7) and (10.16), we can write down the 
corresponding curvature 2-forms. We note that 


az 4km 
Ri; = 2Rijem? 


J 


and thus 
Ry, = RÌ = —2aj?? — 289°, (10.27) 
Ry, = R?! = ap! + py, (10.28) 
Ry, = R? = aft? + pp, (10.29) 
Ry = R! = —2B§?? + 200", (10.30) 
Rro = R” = ppt uy, (10.31) 
Ryo = —R® = Bp? — af, (10.32) 


If we hypothesize the same form for the Schwarzschild line element as 
was done in Chapter 6, we have 


(ds)? = c?f(r)(dt? — h(r)(dry — r?(d0)? — r? sin? (d$. (10.33) 
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As noted above, we are required to identify ?, as the unit vector in the 
radial direction. Making this identification and using Eq. (10.33), we can 
now write 


r = (SE Po, 
y, = ADP, 
=rĵz, 
Yo =r sin 05. 
The coefficients of j, in these last four equations can be used to transform 


the curvature 2-forms from a noncoordinate frame to a coordinate frame. 
From Eqs. (10.27), (10.28), and (10.31) we have 


Roy = r? sin OR, = r° sin 6(—2a?? — 25!?), (10.34) 
Ry, = rh* sin OR, = rh? sin 0(xj?! + $??), (10.35) 
= rf Rg = orf 1B)?! — 00f?9). (10.36) 


From Eqs. (6.9)-(6.12), 








cf' ^10 
t ASE’? 
T, — 0, 
1 
T, = —-— 912. 
AES ape 
p sing cos 0 
2h? 2 
Since 
ð 0 
Ry, = z m ^i aD- DE + RE, 
we find that 
Pho = sin |J h, (10.37) 


Ry, = th- Sh’ sin OF", (10.38) 
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€ f aao 


Ry, = , 10.39 





Matching these equations up with Eqs. (10.34)-(10.36), we note that f = 0 
and 


E ee (10.40) 
h 

h- GP! = 2arht, (10.41) 

f' = —2arhf. (10.42) 


Equations (10.40)-(10.42) can easily be solved for h, æ, and f with the 
standard result. (See Problem 10.2.) In principle one should check that this 
solution obtained from three of the six curvature 2-forms is consistent with 
the equations corresponding to the remaining curvature 2-forms but this is 
a straightforward task. 

Perbaps we have applied a sledge hammer to crush a gnat. However, it 
is plausible that this same sledge hammer can be used to solve more difficult 
problems. As noted before, when the Weyl tensor is replaced by the Riemann 
tensor, Eq. (10.16) becomes Einstein's field equations for empty space. Using 
this approach we have avoided the task of extracting formulas for the 
components of the Ricci tensor from the curvature 2-forms. 

In Section 10.3, we will show another way that the Petrov classification 
scheme can be used to obtain the Kerr metric. 


Problem 10.1. Suppose v — (0, 1, i). Show that any 3-dimensional, complex- 
valued null vector which is orthogonal to v must be a scalar multiple 
of v. 


Problem 10.2. Eliminate the function « from Eqs. (10.40) and (10.41) to 
obtain a differential equation for h. Solve the resulting differential equation 
for h and use the result to obtain the formula for g. Finally solve Eq. (10.42) 
for f using the usual long-range boundary conditions. Then check your 
answer with the result in Chapter 6. 


10.2 Principal Null Directions 


A null vector n = n,y/ is said to have a principal null direction if it satisfies 
the equation 


Ty Caii qnin! = 0. (10.43) 


From the theory of quartic equations, it is known that Eq. (10.43) has 
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four solutions (unless C;;,, = 0). However, it is possible that some of these 
directions may coincide. Furthermore, the coincidence pattern of a Weyl 
tensor is a consequence of its Petrov class. 

It is also known if the direction of a principal null vector coincides with 
another then it satisfies a stronger relation than that represented by Eq. 
(10.43). In particular 


C, t "nunin! = 0. (10.44) 

when n represents two or more coincident directions. Furthermore 
Crijtpnan = 0 (10.45) 

when a represents three or more coincident directions. And finally 
Cn! = 0 (10.46) 


when n represents the direction of all four principal null directions. 

Ann Stehney (1976) demonstrated that the coincidence pattern is related 
to an algebraic relation between the 2-dimensional planes that pass through 
these null vectors. 

Of particular importance are null planes. A 2-dimensional plane is said 
to be a null plane if it contains one and only one null vector. Such a plane 
may be characterized by the fact that it is tangent to the light cone. Since 
a plane can be represented by the vectors that span it, a null plane N that 
contains the null vector n can be represented in the form 


N-n^u-nu (10.47) 


where u is a 1-vector orthogonal to n. 

Actually u can be replaced by u + an without changing N. Thus u may 
be adjusted so that it has no time component. The most general nuil plane 
N containing the null vector n may be presented in the form 


N =n ^ ((nyxP)y — (njyP)x) 


where x and y are arbitrary 1-vectors. 
If we define 


Ci(N) = 3Cig, N a 
and 


C(N) ay ICN) = MC N", 
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then 
CN) = ACi jem (OS) = 1C, ijkm ny yT — yixT) 
= 3(Cijemt™ ny — C; n* Am) Xt yt 
= Cis ipn" xh. 
Furthermore 


(NCN, Do = GN) pg? ta? HC, iml N)" ")o 
= (Na) nq G9" = gg?!) Ci, (N.)" 
= — AD! CO)". 


If Ny = n ^ ((n,x$)y; — (n,y$)x;), then 


(N,C(N,))o = HN)” Ci (SS) 
= ny Cjgp pnn" xR yrx$ y). 
Since N, and N, are arbitrary null planes containing n, it follows that x1, 
y™, x4, and y4 are arbitrary and thus 
(NCN D)o = — (02) 9 Cia UN)" = 0 
«ni Capt n n" = 0. 
Thus we see that the condition that n is a principal null vector is equivalent 
to the condition that (N;)/C,,,(N,)"" = 0 where N, and N, are arbitrary 
null planes containing n. Actually the condition that (N;) IC iem N) = 0 
can be replaced by the condition that NYC, jem N" = N;,CU,,, N*" = 0 for all 
null planes N containing n. (See Problem 10.3.) Thus to determine our desired 
null planes we need to solve two equations: 
N, C” pn N" mE 0, (10.48) 
and 
NN" =0. (10.49) 
Solutions to these two equations may be constructed from solutions 


of some very closely related equations involving the Petrov matrix P. 
Namely 


5-0 (10.50) 
and 


$T) =0 (10.51) 
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where 
W = P) =(M iN). (10.52) 


It is understood here that ? has complex components and if 


then ÐT = (vt, v?, v?). To construct a solution for N from ?, we note that if 
?$ and W satisfy Eq. (10.52) then 


M NT eis i8. 
| |: d - 5 dl (10.53) 
-N Mic ie” w 


where 0 is arbitrary. If ò = p + iĝ and w =F + i$ where P, g, F, and S are 
real, then we can take the real part of Eq. (10.53) to get 


| M Al | | 
—N MJL-? sin @ — 4 cos 0 —?FP sin 0 —s3cos0]| 
If 70 = 0, then p'p — G'q + 2ip'j = 0. Thus |p| = [j| and p79 = 0. The 


reader is now in a position to check the statement that if satisfies Eqs. 
(10.50)-(10.52) then Eqs. (10.48), (10.49) are satisfied by N if 





N = $33(p' cos 0 — q' sin 0) + ?4,(p? cos 0 — q? sin 0) 

+ $12(p? cos 0 — q? sin 0) — fiol p! sin 0 + q! cos 0) 
— faol p? sin 0 + q? cos 0) — f30(p* sin 0 + q? cos 0). — (10.54) 
To extract the null vector n from the null plane N, it is useful to review 
the comments following Eq. (10.47). Examining the coefficients of 719, P20, 
and fzo in Eq. (10.54), we can read off the components of u. From Eq. (10.47), 

it is clear that 
n= —Nu= jo + fi(a?p? — dp?) + PAPP — q' p?) + js(a! pP? — q? p^). 

(10.55) 


We now have the machinery to compute the principal null directions 
for the canonical representations of each of the Petrov classes. For class I 
(see Table 10.1): 


Av! 
P? = Anu? 


3 
A3v 
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Thus ? must satisfy the equations 


VT Pj = (0312, + (024, + (2A; = 0 
and 
pi = (o1? + (c2)? + (v>)? -. 
From these last two equations, we see that the vector ((v')*, (v2)?, (v3)*) is 


orthogonal to both (4,, 45, A44) and (1, 1, 1) and is therefore a scalar multiple 
of (4,, 22, 43) x (1, 1, 1). Thus 


(03)! = r? e?(, — 4) = (a! + ib), (10.56) 
(v?)? Sr et (A3 —A)- (a? + ib^y, (10.57) 
(03)? = r? ei?** (4, — 4,) = (a? + ib3y?. (10.58) 


It is understood that r is adjusted so that aTà = bTb — 1. From Eqs. (10.56), 
(10.57), and (10.58), we can extract four distinct principal null directions by 
letting 


v1 = (a! + ib’, a? + ib?, a? + ib?), (10.59) 
31 = (a! + ib!, a? + ib?, — a? — ib’), (10.60) 
V1 = (a! + ib’, — a? — ib’, a? + ib?), (10.61) 
I =(—a! — ib!, a? + ib?, a? + ib?). (10.62) 


For the first case 
n, = jg + 9,(b2a3 — b?a?) + $,(b3a! — bta?) + $4(bla? — b?a!). (10.63) 


For the canonical representation of Petrov's class I, there is a substantial 
degree of symmetry. To obtain n, from n,, we change the signs of à? and 
b?. This has the effect of reversing the signs for the components of }, and 
f, in n. This corresponds to a 180° rotation in the 1-2 plane. Similarly n, 
can be obtained from n, by a 180° rotation in the 1-3 plane. Finally n4 can 
be obtained from n, by a 180? rotation in the 2-3 plane. The three space 
angles between these directions would be preserved under spatial rotations 
but not under boosts. 

Now let us consider class D. In some sense class D is a special case of 
class I. For class D, 4; = 44 = À and 4, = —24. For Eq. (10.56), at = b! = 0. 
From Eqs. (10.59)-(10.62), it is clear that there are two principal null 
directions: ĵo + ?, and fọ — ?,. A further examination of the same equations 
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reveals that one of these directions results from the coincidence of n, and 
n4, While the other results from the coincidence of n, and n3. 

For class II, there is one doublet principal null direction and two singlet 
principal null directions. For the canonical representation of class II, it is 
useful to let 4 = (a + ib}. Calculation shows that the doublet direction is 
fo — $, and the two singlet directions are 








3(a? + b?) —1, 2, /3a j 23b M 
dus 


I EPI Au hyd ^ dap bel 


and 


38 +b*%)—-1, | 2./3a TE 2. /3b 
tbe! Satay? 345p 








Yo 


We note that one singlet direction can be obtained from the other by a 180° 
rotation in the 2-3 plane. 

Class N is obtained from class II by setting 4 = a = b = 0. In that case 
there is a single null direction which is fọ — f4. 

For class IIT, there is one triplet principal null direction. For the 
canonical form that we have chosen, this principal null direction is fy — f4. 
If A = a + ib, the singlet direction is 


2o 4-g-b, da | 4b 
Yo + ih +7 a tzm 


° 4+a +b +a? +b zy 


4ra rb 


Problem 10.3. Show that the condition that (N;)7 C, (N,)'" = 0 for arbi- 
trary null planes N, and N, passing through the null vector n is equivalent 
to the condition that NUC,;,,, N*" = 0 for any null plane N passing through 
n. Hint: suppose N, and N, represent distinct null planes passing through n. 
Show that any null plane N passing through n may be represented in the 
form N = aN, + bN, where a and b are real scalars. One may also use the 
fact that (N3) C, jiem (N) = (NDË Cis NS)". 


Problem 10.4. In Section 5.5, it was pointed out that a vector that is parallel 
transported around a small closed loop undergoes an infinitesimal rotation. 
The rotation operator that achieves this is I + (Ax (Ay) 4; where the loop 
lies in the plane x ^ y. A vector that undergoes a rotation does not change 
its length. However, a null vector with its zero length may be rotated in such 
a fashion that its direction remains unchanged but it is multiplied by some 
scalar. The condition that a null vector # does not change direction when it 
is parallel transported about the loop Ax ^ Ay is [3(Ax' Ay?) 8$, n] = an 
or 


[3(Ax! Ayi) Rj n] ^ n — 0. (10.64) 
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1. Show Eq. (10.64) is equivalent to Eq. (10.43) if the loop lies in a null 
plane N containing the principal null vector n. 

2. Show that Eq. (10.44) is equivalent to the condition that if the null 
vector n is parallel transported about a small closed loop in any null 
plane N containing n, then n will be completely unchanged. 

3. Show that the condition that n corresponds to the coincidence of two 
principal null directions is equivalent to the condition that C(N) = aN 
where N is an arbitrary null plane containing N. 

4. Show that the condition for a triple coincidence (Eq. (10.45)) is 
equivalent to the condition that C(N) — 0. 

5. Show that the condition that all four principal null directions coincide 
is equivalent to the condition that C(A) = 0 where A is an arbitrary 
(not necessarily null) plane containing the principal null vector n. 

6. Determine the consequence of parallel transport of various vectors 
in various planes when there is a triple or quadruple coincidence of the 
principal null directions. 


10.3 The Kerr Metric Revisited via its Petrov Matrix 


The Kerr metric was first computed by the use of spinors. An advantage of 
using spinors is that one can introduce a principal null direction into the 
computations in a natural way. However, one can also deal with principal 
null directions using Clifford algebra without difficulty. 

In this section, we will demonstrate how the Petrov classification scheme 
can be used to obtain the Kerr metric in a somewhat more straightforward 
manner than was achieved in Chapter 9. 

As in Chapter 9, we note that the Kerr metric is characterized by the 
equation: 


ds = y, dx* 
where 


y= Îr — mwywP,. (10.65) 


The y,’s form the basis of a coordinate frame while the fẹs with the 
underlined indices form the basis of an orthonormal noncoordinate frame. 
In particular 


^o ^ 


iP + Pufy = 2n 


li 


Es 
o 
o 

ll 

È 
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ll 
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ll 
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S 
E 
ll 


0 for j # k. 
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As in Chapter 9, w is a null vector. Since 


ka 


k k k " k 
w=wy = y = WZyy + mwiw, WEP, = WHY es 


it follows that w* = w*. Thus it is unnecessary to underline the indices for 
the components of the null vector w. 

The fact that one might find a solution to Einstein’s field equations with 
the form of the line element imposed by Eq. (10.65) is justified by the fact 
that Eddington’s form of the Schwarzschild metric has the same form. 

As in Chapter 9, the Fock-Ivanenko 2-vectors are 


p - m (wawy). (10.66) 
Also 
T PEE EE T, - r5 4 ET, (10.67) 
oxi Ox* 
where 
By, = Rasa" (10.68) 


In view of Eqs. (10.65)-(10.67), it is clear that 2, is a polynomial in 
the variable m with the powers of m ranging from one up to possibly four. 
In this section, we will deal with only the first-order term. 

In the subsequent computations, we will assume that the null vector 
w, 2 that appears in Eq. (10.65) is a principal null vector. As in Chapter 9, 
it is useful to introduce a 3-dimensional space vector À of unit length. In 
particular 


Wp? = Wolf? + 234, + $245 + $344]. (10.69) 


The canonical form for the class D Petrov matrix is 


—2v 0 0 
P=: O0 v 0]. (10.70) 
0 0 v 


In the previous section, it was shown that the principal null vectors for 
this canonical form are ĵ2 + ji. For our purpose, we need to obtain the 
form of the Petrov matrix when one of the principal null vectors is #2 + JEg 
Our first thought is to consider what happens to the Petrov matrix when 
the vector f+ is rotated into the vector 9*4. 

If double primes are used to designate the frame for the Petrov canonical 
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form and single primes are used to designate the rotated frame then 
$1 = $k), = BR‘, (10.71) 


where & is a rotation operator. There are many rotation operators that will 
achieve this but all will have the same effect on the Petrov matrix. One 
rotation that will achieve our purpose is the 180? rotation about the axis 
that is half-way between (1, 0, 0) and (4,, 45, 43). To write down our desired 
rotation operator, we need to normalize the vector representing the axis of 
rotation which is (1 + 4,, 45, 44). We see that (1 + 4,)? + (43? + (4? = 
(1 + 4? 1 — (44)? = 2(1 + 24). Therefore, our rotation operator is 


0 0 ; 
R = I cos + sin = [n'j22 23 4 y2H3L 4 4,3512] 
1 , 
Eee [( + 40923 + Ay) 3 + 439+]. 
2(1 +2) 
Using this rotation operator, 
$23 = RJR! 


423 _ 9 531. 4 512 
=e SEA + 4022 — 45? Asp] 


x [C + 40522 + A293! + 1,522] 
= —}23[ -I — 83, + p42 221, 





or 
$23 = 9234, 49314, + 9124. (10.72) 

Similarly 

" A , (A43)? ) ^ À343 

$31 — 9237, + 531) 1 + 2777 | 4522 A 10.73 

= == +? prt I (10.73) 
and 

" 1 À A y (A y 

12 23), ,921.72/3. emis : es 10.74 

ped x ea. "T+ MEUM 


From Eq. (10.18), we see that if P" is a Petrov matrix, then 


G22 — i42] [bz — ibt 
P”| 431 — i422 | =| 531 — i520 (10.75) 
G12 — 4430], | 512 — ipse 


However, the entities 4? — id*? transform in the same way as $Ë — if! = 
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(I+ i) ois, Thus if we multiply Eqs. (10.72-(10.74) by (I + iJ) and read off 
the appropriate coefficients, we see that 








423 — iie 423 — játt 
434 2,429 | = R| 434 — i420 (10.76) 
G — i430 412 — 1422 
where 
Ay Ay A3 
1 Ahs 
Re] ”? ERR ] -E4, (10.77) 
Aah As)? 
Ay 2^3 1 +4 ( 3) 
id 1+4 


From Eqs. (10.75) and (10.76), we have 


423 — 1430 bz — ibt 

P"R| 434 — i42? | = R| b31 — ib22 

42 — 1430 bizhi 

—2v 0 0 
P'=RP"'R=R'| 0 v OJR. 

0 0 v 


If we note that R ! is the transpose of R and carry out the matrix 
multiplication, we get 


[-B5435.-—3954.. 3h43 
P' =v —34544 1 — 3(3Y? — 34545 . (10.78) 
— 3434, cede 2-307 


After all that computation, it turns out that P’ is not the Petrov matrix 
for the Kerr metric. As was noted at the beginning of the calculation, for 
the canonical form of Petrov’s matrix, the principal null directions are 
92 + $1. The rotation has carried these two vectors into #2 + $*4,. Thus the 
spatial components of the two null vectors are still aligned. This alignment 
does exist for the highly symmetric Schwarzschild metric where the principal 
null directions correspond to that of ingoing and outgoing radial directions. 
However, this alignment does not exist for the Kerr metric. 

For that reason, we must consider an additional Lorentz transformation 
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X 


— 


q p 


Fig. 10.1. The 3-dimensional unit space vectors 4, p, 
with the relative orientation shown. 


and g are mutually perpendicular 


that will preserve the null vector 79 + 7*4, but not the null vector 72 — ?*A,. 
Two Lorentz transformations which will not work are a spatial rotation 
about the axis À or a pure boost in the direction of 4. A Lorentz 
transformation that does achieve the desired effect is a null rotation, namely 


g- exp( -$ pa get 9i). (10.79) 


where 


p= fip; po = 0, (f)? = —1, and f is orthogonal to Â = PA 


From Eq. (10.79), we can write 


2-Y- (+ ^J ace + ar. 


but 
[ape + PAY = 
so 
f= 1-5 p72 + à) 
or 
Ee £ [f£p, + $234, + =q: + 12294] (10.80) 


where (41, 42, d3) = (Di; Pa Ps) X (Ai, Az, 43). We see that À, p, and à are 
unit space vectors which are mutually perpendicular with the orientation 
shown in Fig. 10.1. 
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Using the null rotation operator £, we have 


(I i2 = 728 — i922 = LOZ iig 
LU +iW)jBL = +i) LIBL-'. 


It is useful to note that 
Q + iJ) p22 = (I + iJ)\GJ922) = (I + iJ)if23. 
In a similar fashion 


$ 


(4i) = (1+ nt -$ 23, + ips) $930; ip) $25 + ie) 


=(I + uj = © Pa + 22. + jag 
-(I-iJ£eg 
where 
Lf =] — Sgar, + pz, + p2z3) (10.81) 
and 


Zk = dy + iPr- (10.82) 
From virtually identical manipulations, we also have 
(cxi)*9!-(-riJ)£^!. 
Since 4(I + iJ)(1 + iJ) = (I + iJ), we now have 
cr iJ)9$34 =A HDLU + iJ)! 

=I + iDP + iJ) Z! 

= (I + iN LjIBLP |, 
With these results, we can write 


$ on ; , 
5 (8 n + Ph $n) 


(I + iJ)923 = (I+ in| = 
x zL + £ (9232, + $212, + 2] 
PN P aa à ; 
= (I + iJ)7# | m ($232, — $212, — $z) 


x | + $ dn +z, + zn 
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To complete this computation, it is useful to note that from Eq. (10.82), it 
follows that 


E 
y *t(-60-2Y( P. Pr + 2ipyqdy — IIe) 
2 3 Ra k Kk — dkik 
=1+0-1=0. 


Completing the calculation for (I + iJ)92 2? and then computing both 
(I + iJ)$3 and (I + iJ)512, we find that 


(I + iJ} (I + iJ) 923 
+ iJ)3i |= L| a+ iJ) 
iD I iJ) 
where 
2 2 2 
2 2 2 
D= ra P zaz, 1 +E ey -07 + © uns 


p? ¢? $? 
— $22 + Pu bz, + $5 1+ iy as 


Repeating previous arguments, we have P = L~'P’L where P' is defined in 
Eq. (10.78). Completing the computation we have 


1—3(,)  -—3vw —3v,v4 
P-y| —3o, 1—30,  —3ww, (10.83) 
— 3030, —3v3v, 1 — 3(v3)? 
where 
Dy = Ay + izr = Ay — OP, + ida. (10.84) 


From P we could determine the real and imaginary parts and then use 
those components to write out the curvature 2-forms. However, we can retain 
a greater simplicity if we apply the operator J + iJ to the curvature 2-forms. 
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For example 


Rio > &Rioif2 
2 —(R22,0 p10 4 R19, 23922) 
— (RS, j 520 + Rie, 31022) = (R1230) 530 + R, 12522) 
"us — (R16 RE R233 Jj? — (R$, T R19, J)j29 
— (R249 + R19,,)29. 
Since (I + iJ)J = (I + iJ)( —i), we can write 


(+ i)Ryo = — + i DER — iR*2 23) 92 
+ (R122 — iR19,,)f29 + (R24, — iR29,)j29]. 


A review of Section 10.1 shows that the coefficients of the 79's in this last 
equation are the components of the top row of the Petrov stie Thus 


(I +i) Bro = —vü + iJ) $29 — 3(v1)?) + $22(— 30,02) + $22(—30405)] 
=v + iJ) (fn, + 30,%,)]. 
Generalizing 


(I+ iJ) Rio = vI+ iJ)? + 30;0,)]. (10.85) 
A similar calculation shows that 
d+ W)Bo3 = iv + i), + 30, %)]. (10.86) 


At the beginning of this section, it was remarked that 22, is a polynomial 
in the constant m. Presumably this means that v is a polynomial in m, that is 


v= vm + v4m? + vym? + ee 


Designating the lowest order term of &, j by d j and replacing the function 
v, by 42, we have 


(1) 
( + iD) Ryo = mi + iS) IOC, + 300), (10.87) 
(1) 
U + iJ) yy = imp? (I + iJ)j + 30i), (10.88) 
a) 
(I iJ)A6, = —imp3(I + iJ)$£(2 + 3020,), (10.89) 


I 


(+i) By, = imp (I + iJ)j9 (92. + 329,), (10.90) 
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Except for a factor of 2, the operator (I 4- iJ) is a projection operator. 
Since the image of a projection operator does not uniquely determine its 
preimage there are many alternative representations of 2, to those that 
appear in Eqs. (10.87)-(10.90). However, for 2-vectors, the preimage of 
(I + iJ) is unique if we require that the preimage be in the 3-dimensional 
space spanned by $19, $29, and 732. 

The forms of the curvature tensor imposed by that of the Petrov matrices 
are not necessarily curvature tensors. A curvature 2-form must also satisfy 
the Bianchi identities. (See Appendix A.3.) In particular 


Viia = 0. (10.91) 
Using these equations it is possible to determine the formula for the 
complex vector v = j* v, and then the formula for the vector 4 — Peay and 


then the formula for the vector 4 = f*4,. 
To pursue this line of attack, we first note that 


ô 
Vidt, = ax! Ri, = T:R + RaT; 


and 


— i, = 0. (10.92) 


For the Kerr metric all time derivatives are zero. Thus Fq. (10.92) implies 
that 


Ad Rio = axe Rio, (10.93) 
and 
ô Ww ô (Q2 ô a) 


From Eqs. (10.87) and (10.93), we have 


0 ð 
Jd [u np + 31°40» | E [u°n,, + 31 v;v, ]. 
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Computing the derivatives, we have 


Qv, ð 
oe 49, oH ow, te D +3 joo, z 


gh i oxi 


du di àv; Qv, 
= yy? + 9 snk Uj pt T, + 3y?v an AVE (10.95) 


From Eq. (10.84), it is clear that v2v, = —1 and 








0 1 6 
2 — ý, =u? f= — ven „vi 
oat "e 7 ga 2 ae a 
_! Q pe 
2 dx? 





Therefore if we multiply both sides of Eq. (10.95) by v? and carry out the 
indicated sums, we have 


Ou 2 Ou ov Ou Ou ov 
2 én ae 2 "P _ oy? v, — 3u? 2 
roa Wd E cU 
Dividing by —3y and collecting like terms, we have 
m Qv, On Ov; 
BG gel a et 
or 
Ü- 0 
axi (u v) = ax Ak (K? vj j- 


This means that 
jk G 2 2 
PÉ — (^w) = d(u^v) = 
oxi E 


In turn this implies that at least locally there exists some scalar function y 
such that 


uv = dy. (10.96) 


The remaining Bianchi identity comes from Eqs. (10.94), (10.88), (10.89), 
and (10.90). From these equations, we have 


Q : 1 
zi [4201 + 3u’ viv] = 0 
" k k 
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or 


Ou Qu , Ovi ; OUk 
3, axe T 9u? oxi vive + 3,8 axi Uk + 3u?vi axi —. 


Multiplying this last equation by —(1/3y)v", and then collecting like terms 
gives us 


. On Ovi 
Quoi — + pw? — = 
E xi H Ox! 
or 
RUN =0. (10.97) 
Oxi 
This means that 
5(u?v) = dd(n) = V?g = 0. (10.98) 


As in Chapter 9, 


y2 = (#4) = (m Rd zum) 
óx* (0x1y?' (0x2)? (0x3)? 


For a radially symmetric solution, 


n= —— +B, (10.99) 





Ù, = ——-——— 
a Ox r? 
or 
~2 42 k 
dete (10.100) 
= r 
Since 
3 —444 3 -4 44 
3 [m A” a Hu ^A 
1-2» (%)’ = 6 È Oty = 4 7 
cg E EaR d 
we see that 
2 
w= e (10.101) 
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Combining Eqs. (10.100) and (10.101), we have 
Dy =A, = —. (10.102) 


Clearly the constant B in Eq. (10.99) has no impact on the metric. The 
solution of Eq. (10.102) corresponds to a version of the Schwarzschild metric 
known as the Eddington form of the Schwarzschild solution (A. S. Eddington 
1924). (See Problem 10.7.) 

For the Kerr metric, we clearly need a complex solution for v,. If 


n = —A*[x? + y? + (z + iay] ? = —A?*(p — io) !, — (10.103) 
then from Eq. (10.96) 
17(04, 05, 3) = A*(p — io) *(x, y, z + ia). (10.104) 


At this point you might infer that the sign conventions used for c and 
à in Eqs. (10.103) and (10.104) represent a deviation from the sign conven- 
tions used in Chapter 9. Actually that is not the case. The sign conventions 
used above are consistent with those used in Chapter 9. Furthermore they 
are necessary to obtain the same answer for å that was obtained in Chapter 9. 
Returning to our computation, we note that 


Mie 


(vj)? = 1 = u^ *A*(p — ic) °Lx? + y? + G + iay] 


il 


k=1 


= u * A*(p — ic) ^. 
Thus 
A = (p — io). (10.105) 


Using this last result, it is possible to eliminate the constant A from Eq. 
(10.104). We then have 


m x y z+ia 
ZI 2.35 "C a) 
z p—ic p—ic p -— ic 
-( px py 5:5) 
Pto pP Ho pte 
( ox oy | ocz- pa 
+1 * 5 
Pte pte p poi 














) (10.106) 


It is not difficult to extract 4 from ?. Since? = À — $p + id, we can take the 
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imaginary part of Eq. (10.106) to obtain 


z ox oy oz + pa 
= 3 10.107 
04 ( 24g?’ p? + 6?’ AM) ( ) 





Referring to Fig. 10.1 and using ordinary vector notation, we get 





Bx OG = GA xG—O>PxG= Op + GA. 


Combining this equation with Eqs. (10.106) and (10.107) gives us 





Á A a —ax 
op + $74 = (= r2 or p0). (10.108) 


Taking the real part of Eq. (10.106), we have 


3.48 px py  pz—oca 
À-dp- ; ; ; 10.109 
o (> to p +a? p. =) 





Adding Eq. (10.108) and (10.109) gives us 


E px-Tay py —ax pz—oa 
1+ oDi =| ————, a, ———— ]. 10.110 
E (8:3 pP +o? m) ( ) 


Using the fact that the length of À is 1, it is now possible to obtain À from 
Eq. (10.110). Carrying out the computation, one gets 


> px +ay py —ax pz —oa 
A = (A, 42, 43) = 3 j . 10.111 
n 2 3) (53 pta zc) ( ) 





Having obtained a formula for 4, what remains is the task of determining 
wo. Now that we have obtained an explicit form for v, it is possible to 
simplify Eq. (10.87). To do this, we first note that 


ô 1 
àx* p — io a 





[x? + y? + (z + ia] * 
= —[x? + y? + (z + ia)?] “iu, 
where u; = x, u, = y, and u, = z + ia. Furthermore, 


e? 


2 -in, 
Oxi 0x* p — nn 








+ 3[x? + y? + (z + ia)?]~2u,u, 
= (p — io) ?(n jk + 30,0). (10.112) 
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Using this result, along with Eq. (10.105), Eq. (10.87) becomes 


1 


(+i) = mA*U + iJ) SM, 
7o i ôxi üx* p — is ` 





(1) : : 
One can extract 2j, from this last equation by taking the real part of 
both sides. In this fashion, we get 








(1) e? p 
Bo = mAs © L 
2 ? Ox! ax" p? + o? 
e? 
— mA3 Jj 2 (10.113) 


axi Ox* p? + o? 


On the other hand 


where 


If we use the fact that 











Matching this with Eq. (10.113) gives us 


e? e? 
Tage Wo = -A aa 
Ox! Ox Ox! Ox* p* + o 


This implies that 





UU ls 


— 4- a constant. 
üxk Ox’ p? + a? 


However, if (wo)? = 0 at oo then the constant must be zero. Repeating the 
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same argument, we have 
(Wo)? = —4? eae 
pte 


The choice of a numerical value of A determines the physical interpretation 
of m. Following the standard convention used in Chapter 9, A = —1. 
Therefore we have 


p 
pco 





(Wo)? = 


Thus we see how the Petrov classification scheme and the symmetry of 
the principal null vectors can be exploited to obtain a metric. 


Problem 10.5. Show that the null rotation of Eq. (10.80) preserves the null 
vector #2 + 9*4, but not the null vector 72 — £A,. 


Problem 10.6. Show that the null rotation of Eq. (10.80) can be factored into 
a product of a boost and a spatial rotation. Hint: Z may be factored into 
the product BR or RB’. The boost operator is different in the two cases 
but in either case the axis of rotation is (q,, 42, q3). 


Problem 10.7. The line element for the usual Schwarzschild metric is 


-1 
(ds)? = (1 — 2) c*(dt)? — ( — =) (dr)? — r?[(d0)? + r? sin 0 (d$)?]. 
r r 
Suppose one substitutes 


ct = x? — 2m In ! r=f, 0 — 6, and $ = >. 








Si 
2m 


Show that the resulting line element is 


(ds)? = (dx°)? — (dr)? — P [(d8)? + ?? sin? 0(dd)?] — zal [dx? + dF]? 
r 





d d dz |? 
Ver 


r r r 


2 
= (dx°)? — (dx)? — (dy)? — (dz)? — 7” [a + 
r 
This is Eddington's form of the Schwarzshild metric (Eddington 1924). 
Problem 10.8. From Eq. (10.111), we know the forula for 4— pea, in 


Cartesian coordinates. Convert this to oblate spheroidal coordinates to show 
that this is the same result obtained in Chapter 9. 


11 


MATRIX REPRESENTATIONS AND 
CLASSIFICATIONS OF CLIFFORD 
ALGEBRAS 


11.1 Matrix Representations of Clifford Algebras 


In the first nine chapters, the intimate connection between differential 
geometry and Clifford algebra has been emphasized. However, some physi- 
cists have found applications for Clifford algebras which are virtually 
divorced from any underlying geometry. For example, in his book Lie 
Algebras in Particle Physics, Howard Georgi uses elements of a Clifford 
algebra to construct creation and annihilation operators (1982, pp. 209—219). 

In any unified field theory, the required symmetries are more apparent 
than any geometric manifold that some theoretical physicist may introduce 
to “explain” the symmetries. If Clifford algebras are found to be useful in 
such applications, it is important for the theoretician to be sensitive to the 
fact that different metric spaces may correspond to isomorphic Clifford 
algebras. 

For example, suppose we designate the signature of a pseudo-Euclidean 
space by (p, q). That is (},)° = (93)? =--- = (f,)? = I and 


pai)” = 0,34 = ++ 0, = 7L 


Suppose, in addition, that we designate the corresponding universal Clifford 
algebra by R,,. Then R, , is isomorphic to R, ,. To see this, we note that 
for R, y we can represent f, by o3, and f; by c,. In this representation R, o, 
treated as a vector space, is spanned by (1, f1, $2,912} = (1,03, 04,105]. On 
the other hand, for R,,, we can represent 7, by o, and 7, by ic,. In this 
representation, R, , is spanned by {I, $4, 2, 915] = (165,105, 01]. In these 
representations, R, 9 and A, , are both spanned by the same matrices and 
are thus isomorphic to one another. 

It is not too difficult to convince oneself that an arbitrary real 2 x 2 
matrix can be written as a linear combination of I, c}, io;, and o,. Thus 
both Rz o and RA, are isomorphic to the algebra of real 2 x 2 matrices, 
which is designated by R(2). 
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This does not mean that all Clifford algebras with the same dimension 
are isomorphic to one another. For example. A, ; is not isomorphic to R(2). 
One can represent Ro. by a subalgebra of the complex 2 x 2 matrices or 
by a subalgebra of the real 4 x 4 matrices. However, it is simpler to identify 
R,,; as the algebra of quaternions, that is (1, f4, 72,912} = (1, Lj, k}. The 
algebra of quaternions is usually designated by H. To summarize the last 
few paragraphs, we have 


Rz o = Ri, = R(2) (the real 2 x 2 matrices) (11.1) 
and 


Ro,2 =H (the quaternions). (11.2) 


It is not too difficult to construct similar formulas for all Clifford 
algebras. This will be done for complex Clifford algebras in this section and 
real Clifford algebras in the next section. Meanwhile, the construction of 
representations of universal Clifford algebras is greatly facilitated by a 
theorem introduced by Marcel Riesz (1993, pp. 10-12). His original theorem 
rephrased in current terminology states that all Clifford algebras are 
universal Clifford algebras. It is now recognized that his theorem is false. 
Correctly stated, the theorem should state that most Clifford algebras are 
universal. In the proof of his theorem, Marcel Riesz overlooked an excep- 
tional case. Nonetheless the method used by Marcel Riesz is useful to prove 
the correct version. 

Before proving the Riesz theorem, let us review the pertinent definitions. 
An n-dimensional orthonormal system of Dirac matrices f4, 75, ..., 9, has 
the following properties: 


(1) (9,)? =I for k = 1,2,..., p and ($,)? = —I for 
k=p+t+1,p+2,....ptq=n. 


(2) Pfr + $19; = 0 for k #j. 


The algebra generated by such a system of Dirac matrices is called a Clifford 
algebra. If the field of scalars is the field of real numbers, the algebra is said 
to be a real Clifford algebra. If the field of scalars is the field of complex 
numbers, the algebra is said to be a complex Clifford algebra. 

Besides properties (1) and (2), there is an additional property which 
occurs for most systems of Dirac matrices. This is: 


(3) by taking all possible products of the n Dirac matrices, one can form 
a set of 2" linearly independent matrices. (These products may be 
written in the form M,M,... M, where M, = $, or I.) 


If property (3) holds, the resulting Clifford algebra is said to be universal. 
The universal Clifford algebra corresponding to the signature (p, q) will be 
designated by R, (C, ,) if the field of scalars is real (complex). 
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To determine whether or not a system of Dirac matrices generates a 
universal Clifford algebra, the corrected version of Marcel Riesz's theorem 
is extremely useful. 


Theorem 11.1. An n-dimensional orthonormal system of Dirac matrices 
generates a vector space (and therefore an algebra) of dimension 2" unless 
the product 7,2, ... f, is a scalar multiple of I. (For real Clifford algebras, 
the exceptional case occurs when J = fi2...n = +I. For complex Clifford 
algebras, the exceptional case occurs when J = +I or cil.) 


Proof. With one exception, a product of one or more distinct Dirac matrices 
will anticommute with at least one Dirac matrix. To see this, we note that 
a product of an even number of Dirac matrices will anticommute with any 
Dirac matrix appearing in the product. We also see that the product of an 
odd number of Dirac matrices will anticommute with any Dirac matrix which 
does not appear in the product. The one product which commutes with all 
Dirac matrices is the product 7,; _, where n is odd. The proof now proceeds 
by self-contradiction. Suppose the products are not linearly independent. In 
that case there exists a set of coefficients 472--J (not all zero) such that 


Adis, — 0. (11.3) 


k=0 fijo... jk 


2... Jk 


If the coefficient of J in Eq. (11.3) is not zero then one can divide Eq. 
(11.3) by that coefficient and obtain the equation 


T+ Y Bile deg, 4 = 0 (11.4) 


where the sum does not include the identity matrix. If the coefficient of J in 
Eq. (11.3) is zero, one can pick out a term with a non-zero coefficient 
(Say Pmims...m,) and multiply the equation by (?,,,,...,,) * which equals 
E aim2...m,- IN this fashion one can always obtain an equation of the same 
form as Eq. (11.4) from Eq. (11.3). 

If the B72---J?s are all zero, we already have the desired contradiction. 
If the sum in Eq. (11.4) contains a product (say $,,,,...,) which anti- 
commutes with ?,, then one can multiply Eq. (11.4) on the left by f» and on 
the right by (fm) ! and obtain 


BEY Bue n ija.. m) = 0. (11.5) 
We note that 


dud une ala) © = e asco Oy le = aa eee 


Therefore we can add Eq. (11.5) to Eq. (11.4) and thereby obtain a new 
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equation which, except for a factor of 2, is identical in form to Eq. (11.4) 
except that at least one less term (7,,,,...,,,) will now appear in the sum. 

If n is even, this process can be continued until the sum reduces to zero 
and thus obtain the contradiction I = 0. If n is odd the process can be 
continued, until we have 


I t aJ — 0. 


Thus we see that our desired contradiction occurs unless J is a scalar multiple 
of I and so the theorem is proved. E 


The class of possible nonuniversal Clifford algebras is narrowed by the 
following theorem. 


Theorem 11.2. If the Clifford algebra generated by an n-dimensional system 
of orthonormal Dirac matrices is not universal then n is odd. Furthermore, 
if the Clifford algebra is real and not universal then p — q — 1 is an integral 
multiple of 4. 


Proof. The first sentence follows from the proof of Theorem 11.1. The second 
sentence follows from the requirement that J? = I. To see this, we first note 
that if J = $1993... În, then 

J = (=D 9,980135... Pn- 


1 
= (OT YO DS n1123- daa 


Thus 

J? = (1) DG Paha Pa Pn In—1 «++ Pahoa) 
or 

J? = (—1)""7 9 — DU = (—1)!9975(—1) 4I. (11.6) 
Since J? = I, 


4n(n—1)—q=2k for some integer k. 
Since n is odd, n = 2m + 1 for some integer m, this last equation becomes 
(2m + 1)(2m) — 2q = 4k or 2m — 2q = 4(k — m°). 


Since 2m =n—1=p+q-—1, this last equation becomes p—q—1= 
4(k — m?). E 


After a few more paragraphs, it will be shown that for pseudo-Euclidean 
spaces of any finite dimension with any signature, it is always possible to 
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construct a system of Dirac matrices such that 7,54... is not a scalar multiple 
of I. This does not mean that nonuniversal Clifford algebras do not exist. 

To see this, suppose the system {f,,92,...,f2_} generates a universal 
Clifford algebra. Then if we define ?,,,., = 7175... Pam. it is not difficult to 
convince oneself that the system {,,)2,---5 fom. Pam+it generates a non- 
universal Clifford algebra. It can be shown that any nonuniversal Clifford 
algebra can be constructed in this manner. (See Problem 11.1.) This means 
that any nonuniversal Clifford algebra associated with a Euclidean or 
pseudo-Euclidean space of dimension n (or 2m + 1) is isomorphic to a 
universal Clifford algebra associated with a Euclidean or pseudo-Euclidean 
space of dimension n — 1. 

We are now in a good position to consider the problem of constructing 
matrix representations for universal Clifford algebras. Probably the easiest 
way to construct explicit representations is by using the Kronecker product 
of matrices. Suppose A is an n x n matrix and B is an m x m matrix. The 
Kronecker product Ao B is an mn x mn matrix. In particular if A = [a;;], 
then the Kronecker product is defined by the partitioned matrix: 


4,,B a,,B ... aiB 
d3,B aB ... aj,B 

AoB=| © 7 MNT (11.7) 
annB a,4B ... AmB 


For example, if 
A T e and Ke A 
421 22 b, b; 


di,b;,. ibiz izbit a,b; 


then 


d,,b;; a,b; a,b, ab 
a21b11 45b; ajb, ab; 


a21b23 45,b;; a5;b;, a5b; 


The properties of the Kronecker product are thoroughly discussed in a 
book entitled Kronecker Products and Matrix Calculus with Applications by 
Alexander Graham (1981). 

It is not too difficult to show that 


(Ac B(C«D) = ACo BD (11.8) 


where (Ac B)(C » D) represents the ordinary matrix product of (A B) with 
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(CoD) and in a similar fashion AC and BD are respectively the ordinary 
matrix products of A with C and B with D. 
Furthermore, it can be shown that 


Ao(BoC) =(AcB)oC. (11.9) 


Now we have the machinery to construct explicit matrix representations 
of Dirac matrices for a nondegenerate vector space over either the field of 
real or complex scalars. Actually we will construct matrix representations of 
the Dirac matrices in C, , and then note that the same Dirac matrices can 
be used for R, ,. The type of Clifford algebras which will be treated first are 
of the type C, 9 where p is an even integer. 

To carry out this construction, we will use the Pauli spin matrices: 


0 1 0 —i 1 0 
6, = 1 0 > 0, = i 0 , and 83 = 0 E . 


For C; o, we can use ?, = o, and ?; = o2. For C, 9, we can let $, = 04°64, 
f2 01905 13 = 01°03, and f4 = 63° I. 

This process can be continued by induction. Suppose it is possible to 
construct a matrix representation of size 2" x 2" for C5, ;. Suppose also 
that we designate the matrix representation of the Dirac matrices for this 
space by 


$,2m) for k= 1,2,...,2m. 


We can then obtain a matrix representation for the Dirac matrices of C3,,4 2,0 
as follows: 


(2m + 2)2c,»$,0m) fork =1,2,...,2m, (11.10) 
Some (2m + 2) = i"o, o J(2m), (11.11) 
Jam 29m + 2) = 0201, (11.12) 


where 


J(2m) = $,(2m)$,(2m) . . . Fam(2m). 


(The factor i" that appears in Eq. 11.11) has been chosen to guarantee that 
(am+1(2m + 2)? = +1) 

It is not difficult to show that the system of matrices just constructed is 
indeed an orthonormal system of Dirac matrices, where (j,)? = +I for 
k= 1,2,..., 2m + 2. (See Prob. 11.2.) From Theorem (11.1), this is all that 
is required to generate the universal Clifford algebra C,,,42,9 since 2m + 2 
is an even integer. 

To construct the Dirac matrices for universal Clifford algebras of the 
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type C, , where p + q = 2m is now a simple matter. One can take the Dirac 
matrices constructed for C,, and simply leave the first p matrices 
unchanged and then multiply each of the remaining q Dirac matrices by 1. 

If p + q = 2m, the elements of C, , form a vector space of dimension 
2?" Over the field of complex numbers, the set of complex matrices of size 
2" x 2" also form a vector space of dimension 2?". Thus it is clear from the 
above construction that if p + q = 2m, the Clifford algebra C, , is isomorphic 
to C(2") where C(2") designates the algebra of complex matrices of size 
2n x 2m 

It turns out that all complex universal Clifford algebras for non- 
degenerate pseudo-Euclidean spaces of a given dimension are the same 
regardless of the signature of the metric. We have just shown that to be the 
case when the dimension of the pseudo-Euclidean space is even. It will be 
left to you to show that if p + q = 2m + 1 then C, , is isomorphic to ^C(2") 
which designates the direct sum C(2") ® C(2"). (Problem 11.3.) 

For real Clifford algebras, the situation is considerably more compli- 
cated. The classification of these algebras will be discussed in detail in the 
next section. 


Problem 11.1. Suppose the system ($,525,...,)54,41)] generates a non- 
universal Clifford algebra. Show that if any single Dirac matrix is eliminated 
from this system then the resulting system of 2m Dirac matrices generates a 
universal Clifford algebra which is isomorphic to the given nonuniversal 
Clifford algebra. (This shows that if ( p, q) is the signature associated with a 
real (complex) nonuniversal Clifford algebra then that nonuniversal Clifford 
algebra is isomorphic to the universal algebra R,.., , (C,-1,,) if p # 0. The 
same nonuniversal algebra is also isomorphic to R, 4-1 (C, ,..,) if q #0.) 


Problem 11.2. Show that 
Pem + 2)? (2m + 2) + $;(2m + 2)),(2m + 2) = 2041, 
where $,(2m + 2) is defined by Eqs. (11.10)-(11.12). 


Problem 11.3. Suppose the Dirac matrices for C,,, o are designated by $,(2m) 
for k = 1,2,..., 2m. Then define 


$.Qm + 1) =03°9,(2m) fork =1,2,...,2m, (11.13) 
and 
Pom+1(2m + 1) = (—i)"03° J(2m). (11.14) 
(1) Show 


92m + 1)9,m + 1) + ?,0m + 1)9 {2m + 1) = 2 yl. 
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(2) Show 
J(2m + 1) = i"o4»]. 


(Parts (1) and (2) show that the set ?,(2m + 1) generates the universal Clifford 
algebra C;,., o.) 


(3) Use the result of part (2) to show that 


$Qm) : 0 
IU + (—i)"J(2m + 1)]9,(2m + I =]oee Dee 
o xj 
and 
0: 0 
ALL + (—i)"JQ2m + 10D]j,Qm-1)2|:s:666 
0 : fm) 


(This result should convince you that C3,,41,9 is isomorphic to the 
A 0 : 
algebra of matrices of the form E B where A and B are arbitrary 


complex matrices of size 2" x 2". This algebra is known as the direct 
sum of C(2") with itself. This particular direct sum is designated by 
C(2") D C(2") or more simply by ?C(2"). 

Adjust the definitions of Eqs. (11.13) and (11.14) so that the resulting 
Dirac matrices are suitable for C, ,. Then convince yourself that the 
basic result of part (3) is valid for C, ,, that is C, , is isomorphic to 
2C(2"), where p + q = 2m + I. 


(4 


— 


11.2 The Classification of all Real Finite Dimensional Clifford. Algebras 


In the last section, it was shown that the universal Clifford algebra C, , is 
isomorphic to C(2") if p + q = 2m and isomorphic to ^C(2") if p - q — 
2m + 1. The classification of the real Clifford algebras is more intri- 
cate. William Kingdom Clifford made some headway on this task (1882). 
Although he recognized that it was possible for ?,54...,, to bea scalar multiple 
of I, it appears that he failed to recognize that not all Clifford algebras are 
uniquely determined by the requirement that 7,7, + frf; = 2nyJ. It is quite 
possible that this prevented him from making more progress on this matter 
during his lifetime. Since his death, this problem has been solved. 

In a paper dedicated to Arnold Shapiro, Michael F. Atiyah and Raoul 
H. Bott published a paper in which all real Clifford algebras of types R, 9 
and Rp, are classified (1964). Their methods are extended to all Clifford 
algebras in Topological Geometry (Porteus 1981, pp. 240—251). The results 
are most neatly summarized in a paper by Pertti Lounesto (1980). 
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To present the methods of Bott and Atiyah, it is useful to introduce a 
few formal definitions and some additional notation. An algebra 4 is said 
to be generated by a subset S if every member of A can be expressed as a 
linear combination of finite products of the elements of S. (Such expressions 
may not be unique.) We will write 


A = [a,25....a,] 
if A is generated by the set 
S ead cce An}. 
A set of generators is by no means unique. Thus we can write 


R,, = [fi Pas Par. - +s VA 


or alternatively 


ada A ^ x 2 
Rpa = [95725315 faiz 9512-5 Pnad 


Another key definition is that of direct product or tensor product. An 
algebra Æ is said to be the direct product or tensor product of two 
subalgebras B and D if 


(1) A is generated by the elements of B and D. 
(2) dim A = dim B dim D. 
(3) Vb e B and d € D, bd = db. 


If these three conditions are satisfied, then one writes A = B ® D. If the field 
of scalars is unclear from the context, this is incorporated into the notation. 
For example, if the field of scalars is the field of real numbers R, one writes 


A-BGg D. 


In the context of matrices, the term Kronecker product is frequently 
used as a synonym for direct or tensor product. This may seem strange since 
the Kronecker product is not commutative. For example o,°03 Æ 63°04. 
However, 


(cio IXI 003) = (o03)(0,»1). 


For this reason, it will become apparent that it is only a slight (if any) abuse 
of terminology to use the term Kronecker product as a synonym for direct 
product. 

We are now in a position to apply the methods of Bott and Atiyah to 
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classify all real Clifford algebras. We first note that 


A. x " x A 
Îi 923312 9412 15125 - +> 9n12] 


L 
= [2,5 $2] Q P312 9412 $5125 Put]. (11.15) 


If (? = (}2)? = +I, then 


Ga) =T —3a2?21& = — (f). 


If the 4,,.’s are treated as the Dirac matrices of a Clifford algebra and p > 2, 
then 


[312 Yarn Ys12 Tria] = R, p-2- 


Thus in this case, Eq. (11.15) becomes 
Roa = Ryo © Ry, p-2- (11.16) 
If p 2 1 and q > 1, we also have 


Rog = [Pir fn Pains 9319 Paim- +s Pen- Dind 
= [LPi Pad Q (Pains Paine aiwm- În- 11a] 
This time 
(fa, == — PainTnik = t, 
so 
Rpa = Ri @ R, aL (11.17) 
In a similar fashion if q 2 2, then we can show 
Rpa = Ro,2 @ Ry-2,p- (11.18) 
From Eqs. (11.1) and (11.2), we know that R, o = R,, = R(2) and 
Ro, = H. Thus, we already see that if p+q is even then R,, can be 
decomposed into a multiple direct product of R(2)’s and H's. The labor of 
classifying all Clifford algebras is substantially reduced by the consequences 


of the following theorem. 


Theorem 11.3. R,, 44,44, = Rp, Where k is any positive or negative integer 
consistent with the requirement that p + 4k > 0 and q — 4k 20. 


Proof. To prove this theorem, it is only necessary to show that R,,4 4.4 = 
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R, a Applying Eqs. (11.16) and (11.18), one gets 
Ry+4,q-4 = R, 0 ® Ry-4,p+2 = Ro e Ro,2 e Ry, q—4- (11.19) 
Applying the same two equations in reverse order, one also gets 
Rp, = Ro,2 © Ry-2,p = Ro,2 @ R,, Q Rpg- (11.20) 


Our desired result is an immediate consequence of Eqs. (11.19) and 
(11.20). 


This last theorem implies that for any value of p + q, we need only 
compute R,,, ,,., for k = —1,0, 1, and 2. However, even this task is cut 
almost in half by the following theorem. 


Theorem 11.4. For a fixed value of p-- q, R,, is an even function of 


2 pad 
p— q — 1; that is 
R p+a+1)2+(p-4-1)/2,(p+4-1)/2-(p-4-1)/2 
= Rip+a+1)/2-(p-4-1)/2,(p+4-1)/2+(p-4-1)/2 
or 


R 


pa — Paei,p-i- 


Proof. Following the approach of Porteus (1981, p. 248), we note that 


Rp = [Pis Pai Pais Pars - +> Pl. 


If (?,)? = +J, then (Pea)? = — fr Pir = — (P). Thus if 5,7; Pais- + +> Pat 
are treated as Dirac matrices, it is clear that they generate R,,, ,. ,. a 


From this last theorem, Rm+2,m-2 = Rm—1,m+1 and Rm+i,m-1 = R,,. 
Thus if p + q is even we need only compute &,., ,,.,, for k — 0 and —1. 
The simpler one is Rm, m. By repeated application of Eq. (11.17), we have 


Rim = R,Q,GR,,G- . -@R, 
or 
Ram = R2) & R2) Q- - -Q RQ) (11.21) 
where each factor appears m times. 
To simplify Eq. (11.21), it will be shown that 


R(r) Q R(s) = R(rs). (11.22) 
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To prove Eq. (11.22), let E,,(r) denote the r x r matrix with 1 in the ith row 
and jth column and 0 in all other positions. Also let 7, denote the r x r 
identity matrix. Then 


R) & R(s) = [E1 r) e, E15) eL, ... Ero. ..] 
© [1,» Ej, (S... Lo E, S) ...] 
=[..., E,(r)o En (s),---] 
= R(rs). 


From Eqs. (11.21) and (11.22), we now have 
Rim = RQ"). (11.23) 
To compute KR, , ,,,,, we note that from Eq. (11.18), we have 


Rm-1,m+1 = Ro,2 Q Rn—1,m-1 = HG RQ" 7). (11.24) 
In general 


HG R(r) = [1eL, iol, fol, keLJ@L..,1°EW,..J = HO) 


which is the algebra of r x r matrices whose components are arbitrary 
quaternions. 
From Eq. (11.24), we now have 


R 


m 


i my1 = H" S). (11.25) 


Let us now turn to the real universal Clifford algebras for the case that 
p + qis odd. From Theorem 11.3, it is only necessary to compute Rm+1+k,m-k 
for k = —1,0,1, and 2. From Theorem 11.4, Rm+1+k,m-k = Rm+i-km+k 
Thus we need only compute R,., , ,,,,,, for k = 0, 1, and 2. 

We note that 


Rm+i+k,m-k = [Pi Pas ett, 32m 41] 
= [Po fa- -o Pomi 7] 
EE [J] & [Fos yz" t j2m4 il. (11.26) 


Because of the values of k which are under consideration, we may require 
that (7,)* = +I. In that case Eq. (11.26) becomes 


Rm+1+k,m-k = [J] @ Rn tigm—x- (11.27) 
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From Eq. (11.6), 


J^2(-pgetimmo( pap (o9 DOL Dg = (-1y(-D 9 
= (-1H. 


For both odd and even values of k, [J] is a 2-dimensional algebra. For 
even values of k, J? = +I and we can represent J by the matrix 


Fa 
0 =i] 
In that case [J] may be regarded as the algebra of 2 x 2 matrices of the 


a 0 
form i ;| whre a and b are real numbers. This algebra is sometimes 


denoted as the direct sum R @ R or more simply by the symbol ?R. 

For odd values of k, J? = —I and we can represent J by i. In this case 
[J] may be identified as the algebra of complex numbers C. In this context, 
C is a 2-dimensional algebra since 1 and i are linearly independent over the 
field of real numbers. 

Now we are in a position to complete the computation of the real 
universal Clifford algebras for odd values of p + q. By successively substitut- 
ing k — 0, 1, and 2 into Eq. (11.27) and using Theorem 11.4 where needed, 
we have 


Rm+i,m = ^RG Eu 


Rn+2,m-1 = CG Rmt+i,m-1 T CO R, m 
and 


Rm+3,m-2 = ^RG Rm+2,m-2 = ^RG Rm-1,m+1: 
Using Eqs. (11.23) and (11.25), these equations become 
Roos = °R Q RQ"), (11.28) 


Ru+2,m—1 = C & RQ"), (11.29) 
and 


Rn+3,m—2 = R Q HQ" t). (11.30) 
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Now 
^R RO) lh 1 p | | Lek 
r= oL, ol, sS °F Ar)... 
i-a a e 2* Esr),...] 
Ej): e][o: o 
0 : of Lo : Ej 


which equals the algebra of real matrices of the form 
Eu 
0 B 
where A and B are arbitrary real matrices of size r x r. This algebra is 
generally designated by the direct sum R(r) ® R(r) or more simply by the 
symbol ?R(r). 


Applying similar arguments to the tensor products that appear in Eqs. 
(11.29) and (11.30), we have 


Ruin = ? R(Q"), (11.31) 
Rm+2,m-1 = CQ"), (11.32) 

and 
Rn+3,m—2 = 7HQ"~*). (11.33) 


All universal real Clifford algebras have now been computed. The results are 
summarized in Table 11.1. 

Using the results obtained for the real universal Clifford algebras, it is 
not too difficult to classify the real nonuniversal Clifford algebras. (See 
Problem 11.4.) The classification of the real nonuniversal Clifford algebras 
is summarized in Table 11.2. 


Problem 11.4. Use the results of Problem 11.1 to construct Table 11.2 from 
Table 11.1. 


Problem 11.5. Show that a nonuniversal complex Clifford algebra can be 
constructed for any signature (p, q) such that p + q = 2m + 1. Show that 
any such Clifford algebra is isomorphic to C(2"). 


Table 11.1. Classificaion of all real universal Clifford algebras: R, ,. 





ip —q—1+ 8k 0 1 5 3 4 





p+q=2m RQ") HQ") 
p+q=2m+1 2R(2") cQ) 2gQn-1) 
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Table 11.2. The classification of all real nonuniversal Clifford algebras: R,, ,. 
Ip -4—1 t 8k = 0 4 


p+q=2m+1 RQ") HQ"-!) 





11.3 The Classification of all c-Unitary Groups 


Among the Clifford numbers belonging to any complex Clifford algebra C, , 
are the c-unitary Clifford numbers. As you may recall, a Clifford number U 
is c-unitary if UU* = I where U* is the complex reverse of U. The set of 
c-unitary Clifford numbers belonging to C, , form a group which is hereby 
designated as the c-unitary group CU( p, q). 

In the discussion of the Clifford number solutions to Dirac's equation 
for the electron, it was noted that expectation values are invariant when a 
solution is multiplied by a c-unitary Clifford number. Thus c-unitary groups 
may form the basis for some useful gauge theory. With this thought in mind, 
this section is devoted to the task of classifying these groups. 

Perhaps the easiest c-unitary groups to deal with are those of type 
CU(2m, 0). In the first section of this chapter near Eq. (11.10), a scheme was 
presented for constructing matrix representations for Clifford algebras. In 
that scheme 


fí(2m) = e,m,, om, o: -omy, (11.34) 


where m,, = I(2), 01,0, or e and c, = 1, — 1, i, or —i. 

For C;, o. $x)” = +I for each value of k. This means that c; is real for 
each value of k. Furthermore the Kronecker product of Hermite matrices is 
Hermite. Since the Pauli matrices are Hermite, it follows that for C;,, , the 
matrix representation of Eq. (11.34) results in Hermite matrices for all of 
the Dirac matrices. In this situation it is a trivial task to show that taking 
the complex reversal of a Clifford number is synonymous with taking the 
complex transpose of its matrix representation. In particular, if 


A= Y Alta eg ufi, Fig 
then 
E AL »3 P b AER - tjt 
=} Aap... nfa = AT. 
Since taking the complex reversal is synonymous with taking the 
complex transpose of its matrix representation, it is obvious that the group 
of c-unitary matrices CU(2m, 0) may be identified with the group U(27)- -- 


that is the group of unitary matrices of size 2" x 2”. 
For Clifford algebras of type C;,,. 1,0, the situation is only slightly more 
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complicated. Here again computing the complex reversal is synonymous with 
computing the complex transpose of the matrix representation presented in 
Section 11.2. However, here the group of unitary matrices consists of those 
to be found in the algebra C(2")G C(2"). This group is denoted by 
U(2") ® U(Q"). Members of this group have the form 


pp 
where U, and U, are unitary matrices of size 2" x 2". 
For C, , where q # 0 and p + q = 2m, the situation is substantially more 
involved. In this case Eq. (11.34) remains valid but c, = +1 only for 


k=1,2,...,p.Fork=p+1,p+2,...,p+4q, c= +i. This means that 


9*(2m) = 9,(2m) fork =1,2,...,p 
and 
$t(2m) = —$,2m) fork =p+1,p+2,...,p+q. 
To deal with this case it is useul to introduce a matrix M which commutes 
with the first p Dirac matrices and anticommutes with the last q Dirac 
matrices. 
If p is odd, let 
M = fis (11.35) 

If p is even, then q is aiso even. In that case let 

M = Îp+1fp+2--- Îp+a (note: q #0). (11.36) 
We now have 


$,(2m)M = ${(2m)M = MjfQm) fork = 1,2,...,2m. 


The reader should not have trouble generalizing the second equality to any 
Clifford number, that is 


AM = MA*. 
This means that 


UUM = UMU*. 
Thus 
U is c-unitary e UMU* = M. (11.37) 
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From the Kronecker product construction of M, it is clear that either M or 
iM is Hermite. If M is not Hermite, multiply M by i and relabel the resulting 
matrix by M. This will not modify Eq. (11.37). However, the adjusted M 
will have the property that M? = +I. This means that the eigenvalues of M 
will be +1. From the Kronecker product construction of M, it is also clear 
that the trace of M is 0. This means that there exists a unitary matrix P such 


that 
I 0 
PMP* = | | 
0 -I 


where I is the identity matrix of size 2" ^! x 2™7 !, 
If we replace each of the Dirac matrices 9, by P9, P*, the Clifford algebra 
remains the same but Eq. (11.37) becomes 


JEN PEE 


The group of matrices that satisfies this relation is denoted by U(2"^?, 2"7 1). 
Thus if q # 0 and p + q = 2m, then 


CU(p, q) = UQ" 1, 2771), 

For the case where p + q = 2m + 1, the situation is even more involved. 
However, it is still possible to construct a matrix M that commutes with the 
first p Dirac matrices and anticommutes with the last q Dirac matrices. If p 
is odd, let 

M = $1;...,Qm + 1). 


To get a useful matrix representation for M, let us construct C, , from C, 
that is, let 


»q— 1? 


$m + 1) = 03° (2m) fork =1,2,...,2m 
and 
Pom+1(2m + 1) = co, ° J(2m) 
where 


c is chosen so that [}2,,4,(2m + D)? = —I. 
Since p is odd, it is clear that 


MQm + 1) = 052515... (Qm) = e,» M(2m). 
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After the same adjustments used before, we can replace M(2m) by a diagonal 
matrix of size 2" x 2" with an equal number of +1’s and —1’s on the 
diagonal. 

Under this circumstance, the group of matrices that satisfies the equation 
UMU* = M is UQ" 1, 2" 1!) @ UQ" 1,2"^5, 

Members of this group are matrices of the form 


EE 
0 U, 
where U, and U, are members of the group U(2" !, 2"), 
Finally, we have the case where p is even and q is odd. To deal with 


this case, it is useful to identify the members of C, , with linear combinations 


of even order forms in C,, , ,. In particular let 


jQm + 1) = ij Qm + 2)%,.,(2m +2) fork=1,2,...,2m+1. (11.38) 


It is not difficult to show that elements on the right-hand side of Eq. 
(11.38) generate forms of even order associated with C,,, ,. Also the 
mapping preserves both the operation of complex reversal and complex 
transpose. This means that if we find a matrix M is C,,, , such that 


AM = Maf* 


for an arbitrary Clifford number . in C,,,,,, the same relation will hold 
when the .’s are restricted to forms of even order in C,, , ,. Since these 
even order forms in C,..,,, correspond to the forms of arbitrary order in 
Cpa this M will serve our purpose. 

What makes this situation somewhat different from those previously 
considered is that this time the matrix M does not belong to the matrix 
representation of C, ,. This is because it is a form of odd order in C, , ,. 


In particular, let 
M = fim + 2)5;m + 2)... 3, 0m + 2). (11.39) 


Examining the construction scheme outlined near Eq. (11.10), it becomes 
obvious that 


M = (oy tto W= oW. (11.40) 


(Since q is odd, q > 1 so $5, ;(2m + 2), which equals o,°J, does not enter 
into the product on the right-hand side of Eq. (11.39).) The matrix W is an 
invertible matrix of size 2?" x 2?", Now with our representation, any 
member of C, , must be of the form 


lo al 
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From Eq. (11.40), the matrix M must be of the form 


PR 


Thus the equation, “UMU* = M", becomes 


AWB* =W (11.41) 
and 


BWA* = W. (11.42) 
From Eq. (11.41), B* = W^!A^!W and 
B= W*(A*) «W*) +. (11.43) 
From Eq. (11.42), we have 
B-W(A*) !W'!. (11.44) 


Equations (11.43) and (11.44) agree without imposing any restriction on 
(A*) ^! since from the Kronecker product construction of W, we know that 
W* = +W. The only restriction on A is that (A*) ! must exist. Thus any 
member of CU( p, q) myst be of the form 


A 0 
U- | | (11.45) 
0 W(A*) Wo? 


where A is any complex invertible matrix of size 2" x 2", There is an obvious 
correspondence between matrices on the right-hand side of Eq. (11.45) and 
matrices of the form [A]. This correspondence is actually an isomorphism. 
Thus the group CU(2k, 2m — 2k + 1) is isomorphic to the group of invertible 
matrices of size 2" x 2" with complex components. This group is known as 
the general linear group of 2" x 2" matrices over the complex numbers and 
is denoted by GL(2”, C). 

To summarize the results of this section, CU( p, q) denotes the group of 
c-unitary elements of C, , and 


CU(2m, 0) = U2"), (11.46) 
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CU(2m + 1,0) = UQ") & UQ”), (11.47) 
CUQm — k, k) = U(2"71,2"71) fork > 1, (11.48) 


CUQk + 1,2m — 2k) = U(2"7!, 27-1)  U"-1,2"-!). form» k, 
(11.49) 


and 


CU(2k, 2m + 1 — 2k) = GL", C). (11.50) 


Problem 11.6. Show that for the mapping of Eq. (11.38), 


($,2m - 1)? =I fork—1,2,...,p 
and 


($,(2m +1)? = —I fork=p+1,p+2,...,2m+1. 
Problem 11.7. Consider the mapping 
92m + 1) = tifa 422m + 2)Pom42(2m +2) fork =1,2,...,2m+ 1. 
Show that this mapping generates an isomorphism between the forms 


of arbitrary order in C, with the even order forms in C,,,,, where 
p+q=2m+1. 


APPENDIX 


A.1 The Product Decomposition of Restricted Lorentz Operators and 
Related Operators 


This section is devoted to some particular factor decompositions of restricted 
Lorentz transformations and related operators. However, the computational 
techniques introduced in this appendix may have equal importance. 

The reader may first ask, what is a restricted Lorentz transformation? 
In Chapter 2, boost operators and rotation operators for Minkowski 4-space 
were discussed. One may ask, what happens when one takes a product of 
several boosts and rotations? One knows that the composition of any number 
of rotations can be expressed as a single rotation. However, it is not difficult 
to show that the composition of two boosts is not another boost unless the 
directions of the two boosts are the same. 

Both a boost operator and a rotation operator is a linear combination 
of p-vectors of even order. Thus any product of boosts and rotations must 
have this same property. In addition for both a boost and a rotation operator, 
one can compute the inverse operator by reversing the order of the Dirac 
matrices. It is not difficult to show that this property must also hold for an 
arbitrary product of boosts and rotations. 

Suppose /* designates the Clifford number obtained from . by 
reversing the order of the Dirac matrices in s. For example, if 


A = al + by. + cf20  d?123 + CPor23, 
then 
A = al + bj; + Choo + dfa21 + ef s210 
= al — biz — cf20 — d$123 + Cfo123- 
of is called the Clifford reverse of »7. We note that 


^ 


LO Pia 3, 0.0 «++ Pin dN = Orn o TRO, Fn Prd 
= Orri e Pind OF tee PAE 
307 
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The reader should not have difficulty generalizing this first to a product 
of two Clifford numbers and then to a product of any finite product of 
Clifford numbers. That is 


(ABY — 28 fd and (AA... AY = A)... el. 
If oZ] = 0,1 fork = 1,2,...,n, then 


(AA... A) = Al... AA 
— i.f buf. 
= (AA... A). 


Thus, if Z is an arbitrary product of boosts and rotations then * = Z-t, 

It will be shown in this section that if Z is an operator composed of a 
linear combination of even order vectors and Zt = ^! then Z can be 
factored into a product of a single boost and a single rotation in a unique 
manner. Such an operator ¥ is a restricted Lorentz transformation. Such 
transformations are said to be restricted because as a set they do not include 
reflections or translations. 

In this section, I will discuss not only the factorization of restricted 
Lorentz operators mentioned above but other factor decompositions not 
only for restricted Lorentz operators but also for slightly more general 
operators which must be dealt with in discussing interpretations of Dirac's 
equation for the electron. 

To deal with the computations necessary for any of these factor 
decompositions, I have found it useful to use a theorem due to Joseph H. 
M. Wedderburn (Albert 1939, p. 39) on the structure of algebras. When this 
theorem is applied to the Clifford algebra of Minkowski 4-space, we find 
that it implies that this particular algebra can be decomposed into a direct 
product of an algebra generated by quaternions and the algebra of real 2 x 2 
matrices. This factorization is not unique, but one factorization that is 
particularly useful is 


LI, 225. $31. $12] X [5 fos J, J$o9] where J = $9125. 


Note several properties. First: every element in the first set commutes with 
every element in the second set. Second: linear combinations of elements in 
the first set form the algebra of quaternions. (This is obvious since we can 
identify fa, with i, ?4, with j, and 5,5 with k.) Third: linear combinations of 
elements of the second set form the algebra of real 2 x 2 matrices. This is 
true since we can represent 


cof awd 
Pe. ue d wr 
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0 1 1 0 
J by | i || and J} by B. cel 


In this representation 


a+d b+c 
al + bĝo + cJ + dJjo = : 
b—c a—d 


Thus the algebra generated by I, fo, J, and J, over the field of real numbers 
is isomorphic to the algebra of real 2 x 2 matrices. 

Any Clifford number associated with Minkowski 4-space can be written 
as a linear combination of elements formed by taking products of elements 
in the quaternion algebra with elements in the 2 x 2 real total matric algebra. 
For example an arbitrary boost operator 28 can be written in the form 


48 — I cosh £ + Jk sinh Ý = exp(19JÁ) (A1.1) 

where 
k = k'as + k93, + KG, (A12) 
Jk =k. + k20 + K93o, (A1.3) 


and (kt, k*, k?) are the direction cosines of the direction of the boost. 
Similarly, a rotation operator @ may be written in the form 


R = I cos ; +â sin? = exp(0f) (A1.4) 


where fi = n!$,, + n134, + n^5,,, and (n!, n?, n?) are the direction cosines of 
the axis of rotation. 

The bivectors k, Jk, and fi are said to be simple. In general any Clifford 
number is said to be simple if it can be written as a product of vectors. It 
is obvious that Jk is simple since Jk = (i$ Pa K?3, + k*$3)%p. It is less 
obvious that k or A is simple. However, if one chooses two unit vectors 
(p, p?, p?) and (q', q?, q?) which are orthogonal to (kt, k*, k?) and orthog- 
onal to one another then 


(kt, k?, k>) = E (pg? — p?q?, pq! — p! qo, p! q? — p’q') 
and 
= ct(p, + p$ + p3(q i + a7 Po + 979s). 


In the following exposition, I will refer to normalized bivectors similar 
to fi and f as simple unit space—space bivectors and normalized bivectors such 
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as Jk as simple unit space-time bivectors. Note that 
(A)? = —[(m + (y + n] -1. 


The advantage of using Wedderburn's decomposition for computational 
purposes is that one only needs to remember the products of the members 
of the quaternion algebra and the matric algebra separately. For the 
quaternion algebra, one only needs to remember the following equation 
which can be verified by the reader; namely 


Ak = —If-k) -AxÉ (A1.5) 

where 
(A-k) = n!k! + n?k? + m? 
and 
Ax k = (n7k3 — n?k?) faa + (mk! — n'k3)93, + (ntk? — n?k!)j,,. 

(The reader should be careful to distinguish Eq. (A1.5), which is an equation 
for the product of 2-vectors in Minkowski 4-space, from Eq. (1.6), which is 
an equation for the product of 1-vectors in Euclidean 3-space.) 

To carry out computations in the matric component of the algebra, one 
only needs to remember (fọ) = I, J? = —I, and foJ = —J$,. Thus for 
example, (J9(J$9) = —($9J)(J$9) = —Fo(J*) Po = Go)” = I. With a little 
practice this can be done quite quickly in one’s head. 

An example of a slightly more complex calculation follows: suppose M 
is an arbitrary Clifford number consisting of a linear combination of vectors 


of even order. Suppose we wish to compute .///*, where Ml? is the Clifford 
reverse of M. We may represent Min the form 


M= al + bñ + cJÁ + dJ (A1.6) 
where ñ and K are normalized so that AA = kk = —(A-A) = —1. Then 


A^ = al + bât + ck! + ast 
=al + b(—ft) + c( —Ky(J) + dJ 
=al — bh — cJÉ + dJ. (A1.7) 


Carrying out the multiplication, we find that many terms cancel out and we 
get 


ALT = a? + b? — c? — d?) + J(2ad + 2be(A-k)). (A1.8) 


We are now in a good position to prove our first theorem. 
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Theorem Al.J. Suppose M is a real Clifford number associated with 
Minkowski 4-space and is composed of a linear combination of even order 
vectors. Also suppose MM #0. Then “= N.Z J where N is a positive 
normalizing constant, Z^ is a linear combination of even forms with the 
property that LL? = I, and  — exp(aJ/2) = I cos x/2 + J sin «/2. (In his 
book Space-Time Algebra, David Hestenes refers to the operator J as a 
duality rotation (1966, p. 17).) 


Proof. Referring to Eqs. (A1.6)-(A1.8), we can let 
N? = [(a? + b? — c? — d? + 4(ad + belk APT, 


cos a = (1/N?)(a? + b? — c? — q?), 
and 
sin « = (1/N2)2(ad + be(k-A)). 
Then 
AL = N*(I cos a+ J sina) = N? exp(aJ). 
Now let N29? = .4£44t^ = N?exp(aJ) and J = +exp(aJ/2). Clearly the 
inverse operator ,Z ! exists. In fact £^! = +exp(—«aJ/2). Now define 


L = (1/N).4£ ^ !. Then LÝ = (1/N),g Y M. But(Z 1)! = Jt. Also J 
and therefore Z^! commutes with all even order vectors. Thus 


LL? = (1/N?) Ut g 7? = UNAN 7) p 7? =1. 


I leave it to the reader to show that aside from the sign ambiguity J 
and £ are uniquely determined by M. a 

Having factored out Y from .Z we can now turn to the problem 
of factoring the remaining factor Z. 


Theorem A1.2. Suppose # is a linear combination of even order vectors 
such that Z L? = I. Then  — BR where B is a boost operator and R is 
a rotation operator. 


Proof. To get a grip on what we are doing, it is useful to multiply a boost 
and a rotation to see what the product looks like: 


(x cosh $ + J sinn?) Peas” + nea 
2 2 2 2 


0 
Be oss a codi oa ua X eR C uns 
2 2 2 2 2 2 


+ Jf x fisinh 9 sin : — J(m-A) sinh d sin a (A1.9) 
2: 2 2- 2 
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By examining the first two terms on the right-hand side of Eq. (A1.9), one 


can determine the rotation component of any restricted Lorentz operator 
by inspection. Thus if 


L =al 4 bh + cJÉ 4 dJ 





i 


then 
0 
acu eR 
2 2 
where 
0 a . 0 b 
cos — = — — and sin- = —————. 
2 a? + b? 2 a^ + b? 


To show that the factoring can be done, we only need to show that 22^! 
is a boost: 


ggi-—lo- (al + bá + cJk + dJ)(al — bh) 
Ja? +b? 
! 2 2 ( A 
= — — [(a? + b*) I + J(ack — bekñ + ad — bdAy] 
a? + b? 





i 


A ia [(a? + b?) + Jack — bek x A — bdñ) 


Ja? +b 


+ J(ad + be(k-A))]. (A1.10) 


Since Z Lt = I, it follows from Eq. (A1.8) that ad + be(k-f) = 0. What 
remains on the right-hand side of Eq. (A1.10) looks hopefully like a boost. 
To push this further, let us examine the magnitude of the bivector. Suppose 
v = ack — bc(k x ft) — bdf. Then 
(0:0) = ac? (k-k) + b?c7|k x Al? + bd? (ñh) — 2acbd(fi - k) 
= a?c? + b?c?(1 — (k-A)*) + bd? — 2ad(bc(A- k). 


Using Eq. (A1.8) again, we have bc(k-A) = —ad and 


(0-v) = a?c? + b?c? — a?d? + b?d? + 2a?d? 
= a?c? + b?c?  a?d? + b*d? 
= (a? + b*)(c? + d°). 
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Then 





(ack — be(k x ñ) — bdft) = (a + b?)(c? + d?) 


where 7f is a simple unit space—space bivector. Using this result Eq. (A1.10) 
becomes 





LR — fat x BH eH dH. 


To identify this as a boost operator, we need to make the identification that 


Ja +b = cosh( $) and Jc? 4-d^- si (7) 


for some appropriate value for $. Since cosh?($/2) — sinh?($/2) = 1, our 
desired identification is possible only if (a? + b?) — (c? + d?) = 1. However, 
looking at Eq. (A1.8) again we see that 2 = I implies that (a? + b?) — 
(c? + d?) = 1. Thus we do indeed have our sought-after boost operator. W 


It should be noted that a restricted Lorentz operator can also be factored 
with the rotation operator on the left side. In that case the rotation operator 
will be the same but the direction of the boost operator will be different 
unless the direction of the boost is the same as the direction of the axis of 
rotation. To see this we note that 


Bim, PR, 0) = R(R'BR) = RBH, p) 
where 


f = RÂ, DARA, 0). 


Combining Theorems A1.1 and A1.2, we see that if MMM 40, then 
M= NBR, where N is a positive normalizing constant, B is a boost 
operator, @ is a rotation operator, and 7 is a duality rotation. To complete 
the picture, let us consider the case where .4///! = 0. 


Theorem A1.3. Suppose Mis a linear combination of even order p-vectors 
associated with Minkowski 4-space and ./4/^ = 0. Then “= NRP, where 
N is a positive normalizing constant, 2 is a rotation operator, and F is a 
projection operator of the form i(I + Jm) where fà is a simple unit 
space-space bivector. Furthermore N, @ and Y are unique. (An operator 
P is said to be a projection operator if P? = P.) 


Proof. By examining a product of the form NRP, one can pick out the 
rotation operator by inspection and the rest rapidly ensues. Suppose 
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R = exp(40f) and P = 1I + JM). Then 
N 0 . 0 0 .. 0 : 
voto - P res + Asin +3 m cos — + (A x f) sin — ET : 
2 2 2 2 2 2 


Thus if A — al + bñ + cJk + dJ, inspection of the first two terms tells us 
that 





b 
se eu = 
No UN xw 
To obtain the projection factor, we compute 
1 
RIM = —  — — (al — baal + bh + cJÉ + dJ) 


/q? + bp? 
1 ‘ ; 
0 [(? + b?)I + J(ack — bef x k) — bdñ) 
/ q? + b? 
+ J(ad + bc(A-£))]. 


Following steps virtually identical to those in the proof of Theorem A12, 
we get 


gp iu — FEX [(a? + b2)1 + (c? + dP) JA] 
Ja + b? 
where 
c ack — be(h x k) — bah 
J (a? + bà»? + d? 








However, using Eq. (A1.8), ./4/^ — 0 implies that (a? + b?) = (c? + d?). 
Thus RIM — NÀ(I + Jm) where N = 24/a? + b?. m 


A.2 The Exponential Representation of Restricted Lorentz Operators 


In the first section of this appendix, it was shown that a restricted Lorentz 
operator can be decomposed into a product of a boost and a rotation 
operator. In this section, it will be shown that a restricted Lorentz operator 
can be represented as a single exponential operator with a bivector argument. 
In proving this assertion, it will be shown how to construct the representa- 
tion. From this representation, it will be shown how to read off the 
eigenvectors. Our first theorem to be proven is: 


Theorem 42.1. Suppose ¥ is a restricted Lorentz transformation. In 
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particular suppose  — al + bi + cJk + dJ. Then one of the two following 
cases hold. 


(1) (bf + cJÉ)? = 0 and 





L = exp(bf(I — Jm)) or L = —exp(— bÂ — Jm)) 
= +I + bad — Jh), (A2.1) 
where m = ñk =A x k and the three entities k, ih, and A form a 
right-hand system of orthonormal space-space bivectors. If we 
identify k = $54, i = $4,, and Â = $,,, then 
L = exp(bf,($o + $2) or L = —exp(— b( (0o + 22). 
(A2.2) 
(2) (bh + cJk)? #0 and 


L = exp(3(o — 0J)2), (A2.3) 


where 2 is a “complex multiple" of (bñ + cJk) such that (2)? = I. 
To say 2 is a “complex multiple" of (bñ + cJk), I mean that 
2 = (1/r) exp(—aJ/2)(bA + cJÉ), where exp(—aJ/2) is considered 
*complex" because in this context J is algebraically equivalent to 
./ —1. For this second case it is possible to express . in the form: 


L = explo + 0524). (A2.4) 


However, in general, this form is valid only in a boosted frame and 
not in the frame of the observer. 


Proof for case 1. 


L — (al -- dJ) + (bâ -- cJk), Lt =(al + dJ) — (bh + cJh), 
and 


LL? = (al + dJY! = (a? — d?)1 + adJ = I. 


Since a? = d? + 1, we know that a #0 and since 2ad = 0, we know that 
d = 0. It then follows that a = +1. Furthermore (bñ + cJk)? = —b? + c? — 
2bcJ(fi-k) = 0. Thus either both b and c are zero or neither is zero and 
(A-k) = 0. In the first case Y = -- I. In the second case, c = +b. Absorbing 
the sign ambiguity of c into the definition of k, we have 


(b + cJk) = (bá + bJÉ) = bA — JAK) = bA — Ji) 
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and 


L = +14 dA — Ji) = texp(+ bad — JM). 


The reader should check that Å, ri, and A form a right-handed system of 
orthonormal bivectors, that is Ax m=mh=—Ah=k, ñx k= ñk = 
—kh =m, and k x ih = ki = —mk = f. 


Proof for case 2. For case 2, it is helpful to observe that in this context J is 


algebraically equivalent to ./ — 1. Thus we can use our knowledge of complex 
numbers to carry out our computations. We first observe that (b + cJ k}? = 
(—b? +I — 2be(k- f)J which is much like an ordinary complex number. 
Thus if we let 


(—b? + c?)I — 2bc(k-8)J = r?(I cosa + J sina) = r? exp(aJ), (A2.5) 


and define 


S - (bi + cJÁ) ew (7). (A2.6) 


then it is not difficult to show that (2)? = I. 
With these definitions, we have 


L=al+dJ+r exp( 7): (A2.7) 
or 
L=A+4+ Be. (A2.8) 
We observe that since L £t = (A + B2)(A — B2), 
A — Bg - Ir. (A2.9) 


From Eq. (A2.9), we can identify A and B with the hyperbolic cosine and 
sine functions with “complex” arguments, that is 


A = cosh(3(¢ — 0J)) (A2.10) 
and 
B = sinh(á($ — @J)). (A2.11) 


With these definitions, we have 


L = cosh(3(¢ — 0J)) + 2 sinhG(¢d — 6J)) 
= exp(i(ó — 0J)2). (A2.12) 
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To determine $ and 0, we can expand the right-hand sides of Eqs. (A2.10) 
and (A2.11) and compare the results with the right-hand side of Eq. 
(A2.7). Thus we have 


A-I cost( 2) cosh A7) sn (?) sinn( 
2 2 2 2 
=I cosh( $) sed) —J sinn( sin($) 
2 2 2 2 


— al 4 dJ, (A2.13) 
B=! si (^) so (A7) — cosh($) s^) 
2 2 2 2 
=I sn(?) se) —J cost($) s (7) 
2 2 2 2 


= Ir cos(5) + Jr a (2). (A2.14) 
2 2 
Thus we have 


cost( $) (^) =A, snh( sin(> = —d, (A2.15), (A2.16) 
2 2 2 2 
sinh( cos(5) =F (2). (A2.17) 
2 2 2 
cost( 2) s (^) = -r sa(2) ] (A2.18) 
2 2 2 


Taking ratios of these equations, we have 








and 


and 











ann 2) E a PEGA (A2.19) 
2 a r sin(x/2) 
and 
w^) NN SU MM CN (A2.20) 
2 a r cos(a/2) 


The argument ¢ is uniquely determined by Eq. (A2.19). The argument @ is 
uniquely determined by Eq. (A2.20) if we require that —n/2 < 0/2 < 1/2 
when a is nonnegative and 1/2 < 0/2 < 3n/2 when a is negative. 

We now turn to the problem of showing that we can write Z as 
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exp(i($f,g + 02,,)). To do this we have to examine the form of ê more 
closely. From Eq. (A2.6), we have 


1 ^ n 
a= |^ (5s +e sin( 2) +J (-» s (2) Ttc (2) 3] (A2.21) 
r 2 2 2 2 


Or 
2-x4Jy 
where 
i- HC ex); tc (2) e) (A2.22) 
and 


-lf ponta 
»- b sin ; Jn ee2)£). (A2.23) 


From these equations, 


UZD (e2 a h2) un & ^ aff af%\ an & 
r*(x*y) = (c^ — b  sn( 7) (2) + bc(fi {co (5) sin (5) 
= l(c? — b?) sin a + be(k-A) cos a. 


From Eq. (A2.5), 


(c? — b?) = r? cosa and be(k-A) = —4r? sin a. 
Thus 
(3j) =0. (A2.24) 


In addition 


1V[5?(14- cosa)  c?(1 — cos a) 





+ be(k-A) sin a| 


2 3 222 : 
S p 5 : cos a + bei) sin | 








or 
b +c? — 4? 


(%-%) z 


(A2.25) 
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A similar calculation shows that 


b? +c? + 7? 


52 (A2.26) 


y= 


Since the right-hand sides of (A2.25) and (A2.26) are both positive and their 
difference is 1, we can define 


(X: X) = sinh? f and (y: y) = cosh? B. 


Furthermore since x and y are orthogonal space-space bivectors, we can 
choose an orientation for our frame such that 


X = ,, sinh f and y = P23 cosh f. 


This means that 


N> 


= x Jy = 42 sinh f + fio cosh f 
f (Po cosh fi + f sinh £). (A2.27) 


In a boosted frame, we have 


fo = fo cosh f + $; sinh f, 
$55 
92 = fo sinh B + 7; cosh £, 
75 = 35. 
In this frame, we have 
2 = fio and Jê = —55. 


Dropping the primes, Eq. (A2.12) becomes 


L = expG($jio + 0f23)). W (A228) 


Using the representation Z = exp(i($ — 0J)2), one can obtain explicit 
formulas for the eigenvalues and eigenvectors. In particular we have: 


Theorem A2.2. If Z = exp((d — 0J)2), then 


LRP Y) + JfoXy E Jfoy] Z! = e** [9G 3) + JfoXy E Jjoy] (A229) 
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and 
PEPEE) + J)oXy E iI XI LT = e* [9 o(x- x) + Jjoxy + iJfox]. 
(A2.30) 


Proof. To check these equations, one needs to compute Z4, LIP xL", 
LIPIL’, and LJ XPL. To compute 7,4, we note that y commutes 
with both x and y but anticommutes with J. Thus 
LIL" = (exp — 9J)z) Fo exp(— (6 — 94) 2) 
= fo exp(3(p + 0J)2*) exp( — «(9 — 0J)2). 


Here it is understood that 2* = x — Jy. We now have 


LoL = $o[cosh H(p + 0J) + 2* sinh à($ + 0J)] 
x [cosh 4(@ — 0J) — 2sinh 3(¢ — 0J)]. 
We note that 
2*2 = (x — JYX + Jy) = xx + yy + J(Xy — yx) 
L-—(x:x)-(y:y) + 2Jxy. 
Also 
cosh A cosh B = 3[cosh(A + B) + cosh(A — B)], 
sinh A sinh B = 4[cosh(A + B) — cosh(A — B)], 
and 
sinh A cosh B = i[sinh(A + B) + sinh(A — B)]. 
This gives us 
LPL? = $9[3(cosh $ + cosh(6J)) 
+ (Xx) + (i y)i(cosh $ — cosh(0J)) 
— 2Jxyi(cosh $ — cosh(6J)) 
+ 2*4(sinh $ + sinh(0J)) — 24(sinh $ + sinh( — 6J))]. 
We observe that cosh(0J) = I cos 0, sinh(0J) = J sin 0, and sinh(—0J) = 


—sinh(0J) = —J sin 0. Also (y-y) — (x:X) = 1. From these equations, we 
have 


S $94 = $s[(y* y) cosh $ — (X: x) cos 0 
— JXy(cosh $ — cos 0) + JX sin 0 — Jy sinh $]. 
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Multiplying out the ?, and reordering the terms, gives us 
LP? = $4[(y* y) cosh $ — (X*X) cos 0] — JX sin 0 
+ J$oy sinh $ + JfoXy(cosh $ — cos 0). (A2.31) 
Similar calculations give us 
PIPP) L? = —Fo(X-X)(F-¥)[cosh $ — cos 6] 
+ J$9X(y* y) sin 0 — Jfoy(x-* X) sinh 9 
— J$oXy[(X* X) cosh à — (y- y) cos 0], (A2.32) 


S (Jjoy) 4! = (9oCy- y) + JfoXy) sinh $ + Jjoy cosh p,  (A2.33) 
and 


LIX) Lt = —[jg(X*x) + JjgX] sin 0 + Jjg cos 0. — (A2.34) 


If we multiply Eq. (A2.31) by (y-y) and add the result to Eq. (A2.32) the 
trigonometric functions drop out. That is 


S [jo Y) + IPX] Lt = (9oCy* y) + JjoXy) cosh $ + Joy sinh $. 
(A2.35) 


In a similar fashion, if we multiply Eq. (A2.31) by (x*X) and add the result 
to Eq. (A2.32), the hyperbolic trigonometric functions drop out and we have: 


LB (KX) + Jjoxy] LT = ($9(X* X) + JjoXy) cos 0 + Jjox sin ð. (A2.36) 
Adding and subtracting Eq. (A2.33) from Eq. (A2.35), gives us 
S [jo y) + JjoXy t IF IIL = et * [99 y) + IPoXF + Ifo II. 
From Eqs. (A2.36) and (A2.34), we also get 
L [$9 X) + JjoXy t iJjox]d4! = et EE) + JAxy riJjx] Wi 
When Z = I + bf(I — Jm) and (f'm) = 0, the situation is quite differ- 
ent. This operator is described as a null rotation. Such operators for Lorentz 
spaces of arbitrary dimension were discussed in detail by Marcel Riesz (1993, 
pp. 84-127). For this 4-dimensional case, we have 
Theorem A2.3. Suppose # = I + bf(I — Jm) where (f-m) = 0. In this case 
there are only two eigenvectors each of which has an eigenvalue equal to 1. 


One is the space-like eigenvector J})f. The other eigenvector is the light-like 
or null vector ĵo + J$9rfi. The null vector is part of a 3-dimensional subspace 


322 CLIFFORD ALGEBRA 


spanned by Jfom, Jorn, and fo + Jjgrh which is invariant under this 
transformation. Furthermore, suppose (L — I)8 = Zù Zt — 9 where ? is 
a l-vector. Then the image of L — I acting on the 3-dimensional space 
described above is the 2-dimensional space spanned by J}orhA and fo + 
Jorfi. This 2-dimensional plane is described as light-like. It is tangent to the 
light cone and contains no time-like vector. In turn, the image of L — I acting 
on this 2-dimensional space is the 1-dimensional space spanned by the 
light-like vector fọ + Jfomh. Finally the image of L — I acting on this 
1-dimensional space is 0. 


Proof. To prove the theorem one simply shows that (Joft).Z* = Jf of, 


LIIM) Lt = Jj — 2bJj gr — 2b? (9o + Jor), 
S (Jig) Le = Ipoh + 2b()y + Jom), 
and 


Llo +IP) Lt = Po + Ifo. 
This exercise is left to the reader. a 


From Eqs. (A2.34) and (A2.36), we see that when Z = exp(4($ — 6J)2), 
£ generates a rotation of angle 0 in the 2-dimensional plane spanned by the 
two space-like vectors fo(X°xX) + Jjoxy and J?ox. (See Problem A2.1.) 

From Eqs. (A2.33) and (A2.35), it is clear that the same Lorentz 
transformation generates a boost in the 2-dimensional plane spanned by 
the time-like vector ?o(y-y) + Jf xy and the space-like vector Joy. (See 
Problem A2.2.) 

The eigenvectors jo(y:y) + JjoXy + Jjgy are both null or light-like 
because they have length 0. Such vectors are important because they 
correspond to light rays. 

Penrose and Rindler (1984, p. 28) make the observation that in general 
a restricted Lorentz transformation may be specified by taking any three 
distinct null vectors and then choosing the directions of their image null 
vectors. This implies that for a nontrivial Lorentz transformation, at most 
two null vector directions may be fixed. Penrose and Rindler go on to state 
that from topological arguments it can be shown that the direction of at 
least one null vector must be fixed. 

The case where the directions of two null vectors are fixed corre- 
sponds to the eigenvectors ?o(y- y) + JjoXy + Joy for the Lorentz operator 
exp(ó — 6J)2). 

The case where the direction of only one null vector is fixed corresponds 
to the null eigenvector fọ + Jfgrf for the null rotation J + bad — Jim). 


Problem A2.1. Show that the product of the vector fo(X°x) + J# oxy with 
itself is — (X-X). This shows that the vector in question is space-like. 
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Problem A2.2. Show that the product of the vector j?o(y-y) + J?oxy with 
itself is (y+ y). This shows that this vector is time-like. 


Problem A2.3. According to Theorem A1.2, a restricted Lorentz operator 
£ can be written as a product of the form 2222 where B is a boost and 2 
is a rotation. Compute 27 and & where 9 is the null rotation I + bAU — JM). 
What is the angle between the axis of rotation and the direction of the 
velocity for the boost? 


Problem A2.4. Suppose P, = (I + 4), P. =4U — 2, Q, = XI + iJ2), and 
Q =I - iJ) 


(1) Show P, and P. are orthogonal projection operators, that is 
(P,)? = P,, (P_) = P_, and P,P_ = P_P, =0. 

(2) Show Q, and Q_ satisfy a similar set of equations. 

(3) Show I = (P, + P_(Qi + Q.) - P.Q. + P.Q- c P-Q.  P-Q.. 

(4) Show expG(ó — 0.)2) P,Q, = e?? e"? P, Q,. 

(5) Show more generally that 


expG(ó — 0J)2) P.Q.. = (e^ *? P,)(e* "* 9, ). 


(6) Show 
explo — 0J)2) = e* P,Q, c eI p,Q. +e 9*p g, 
+e?) P Q 
A.3 The Bianchi Identity 
From Eq. (5.70): 


If we apply the operator V; to both sides of this equation, we get 


(V;V;V, = V.V, Vj) 4 = —V, Rix) A = FRNA 
+ (VA Ry + AWN, R (A3.2) 


If we now subject this equation to the permutation i > j > k — i twice, we 
get two more equations: 


(VV, V, EE VV, V) = —HV, By) A == SRN A 
+ VAi Ri + ADV; Be: (A3.3) 
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and 
(V, V,V, e V, V; Vi) 4 - AVR, ) A E IR, Vel 
+ (V,A)3R,, + AWN, RB; (A3.4) 


Adding the last three equations together, we see that 
+IVA) Buy + FAV Rig. (A3.5) 


We now pursue the task of obtaining an alternative expression for the 
left-hand side of Eq. (A3.5). To achieve this, we first replace «Z, by V, in 
Eq. (A3.1) and then apply the same permutations that we did before. The 
three resulting equations are 


(V,V.V, — VV V) A = iR V, A + (V AiR 

(V, V,V, — V,V,Vj)ad = —198, V A + (Vis )390,;, 
and 

(V, V;V, — VV, V) A = iR; Vp A + (Vp AIR; 


Adding these last three equations now gives us 
6V, V Vg l = — 388, Va A + AVA) R. (A3.6) 
By subtracting Eq. (A3.6) from Eq. (A3.5), we see that 
0 = -0 VGRA + AAV Ri. (A3.7) 
Since . is an arbitrary Clifford number, it is now clear that 
Ving = 0. (A3.8) 


Equation (A3.8) is the Bianchi identity. However, the usual form of the 
Bianchi identity is 


R 0. (A3.9) 


paliz; k] T 


It is left to the reader to show that Eq. (A3.9) is equivalent to Eq. (A3.8). 
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